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Preface 



The multiset (a set with multiplicities associated with its elements, in the form 
of natural numbers) is a notion which has appeared again and again in many 
areas of mathematics and computer science, sometimes called a hag (some his- 
torical information appears in the enclosed paper by A. Syropoulos). As a data 
structure, this notion stands “in-between” strings/lists, where a linear ordering 
of symbols/items is present, and sets, where no ordering and no multiplicity is 
considered; in a multiset, only the multiplicity of elements matters, not their 
ordering. Actually, in between lists and multisets we also have pomsets, partially 
ordered multisets. 

Confining ourselves to computer science, we may mention many areas where 
multisets are used: formal power series, Petri nets, data bases, logics, formal 
language theory (in relation with Parikh mapping, commutative grammars, etc), 
concurrency, and so on. In the last few years, the notion has occurred in a 
rather natural way in the molecular computing area. An aqueous solution of 
chemical compounds, swimming together in a given space, without any given 
spatial relation between individual elements, is just a multiset. Actually, this 
chemical metaphor was used several years before the occurrence of what is now 
called molecular computing, as the basic ingredient of the Gamma language 
and the Chemical Abstract Machine (a comprehensive survey of these ideas is 
provided by J.-P. Banatre, P. Fradet, D. Le Metayer). Then, multisets were 
used in relation with DNA computing, especially in the context of computing by 
splicing (H systems): taking into account the number of DNA molecules proved to 
be a very powerful feature of H systems, leading to computational completeness. 

In the prolongation of the chemical metaphor, the membrane computing 
area has recently emerged, as an abstraction of the living cell structure and 
biochemistry: in the compartments defined by a membrane structure, one pro- 
cesses multisets of chemical compounds, denoted by symbols or by strings over 
a given alphabet (up-to-date details about this area can be found on the web 
page http://bioinformatics.bio.disco.unimib.it/psystems, mastered by 
C. Zandron, in Milan). 

In spite of all these “applications” and of many other related topics, the no- 
tion of a multiset has not received any systematic, monographic attention. In 
August 2000, a workshop was organized in Curtea de Arge§, Romania, explicitly 
with the aim of discussing the mathematical backgrounds, the computer science 
and molecular computing relevance of the notion of a multiset, with emphasis 
on multiset processing in the distributed framework of membrane computing. 
The editors of the present volume constituted the program committee of the 
workshop, while the organizing institutions were the “Simion Stoilow” Institute 
of Mathematics of the Romanian Academy, Bucharest, the Artificial Intelligence 
Department of the Polytechnical University of Madrid, the Centre for Discrete 
Mathematics and Theoretical Computer Science of Auckland University, New 
Zealand, and the “Vlaicu-Voda” High School of Curtea de Arge§, Romania. A 
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technical report (no 140/2000) of CDMTCS, Auckland University, was published 
as a pre-proceedings working volume, in anticipation of obtaining improved ver- 
sions of the papers during the workshop, through the interaction of the authors. 
Actually, several new papers were started during the meeting as a result of co- 
operation among participants. Some of these papers were collected in a special 
double issue of the Romanian Journal of Information Science and Technology 
(ROMJIST), published as nr. 1-2 of volume 4, from 2001. Other papers, as well 
as several additional invited papers, are contained in the present volume. 

While the ROMJIST issue is mainly devoted to DNA and membrane com- 
puting, the present volume is focused on fundamentals, on mathematical formal- 
izations, and computer science (including molecular computing) applications of 
the notion of a multiset. Besides the two papers mentioned above, there are 
papers about the mathematical status of the notion of a multiset (S. Marcus, 
S. Miyamoto, T.Y. Nishida), about a theoretical, formal language-like study of 
multisets (by E. Csuhaj-Varju, M. Kudlek, C. Martin-Vide, V. Mitrana, Gh. 
Paun), with the ambition of founding a consistent rewriting Chomsky-like the- 
ory of multisets, as well as applicative papers: multisets and data bases (a rather 
comprehensive paper by G. Lamberti, M. Melchiori, M. Zanella), multisets in 
visual languages (P. Bottom, B. Meyer, F. Paris! Presice), multisets and dy- 
namic systems (W. Reising), multisets and formal languages (J. Dassow). Sev- 
eral papers are related to membrane computing, discussing the relation with 
constraint programming (A. Dovier, C. Piazza, G. Rossi), the power (M. Madhu, 
K. Krithivasan) and efficiency of membrane systems (A. Obtulowicz), a possible 
implementation on the usual electronic computer (F. Arroyo, A.V. Baranda, J. 
Castellanos, C. Luengo, L.F. Mingo), or applications in Artificial Life, mainly 
via extensive simulations on the computer (Y. Suzuki, Y. Fujiwara, H. Tanaka). 

These papers, besides a large quantity of background information (basic no- 
tions and results, bibliography, further developments not presented in an exten- 
sive manner), also contain research topics and problems, so that the volume is 
intended to be useful both to the “users” of the notion of a multiset and to 
those who want to contribute to the development of its theory, with special at- 
tention being paid to the computer science approaches (including the membrane 
computing area). 

Of course, this is only a step towards “FMT” (Formal Macroset/Multiset 
Theory), to borrow one of the syntagmas used as the title of a paper included 
in the volume. Hopefully, the next step will be done by the reader. 

* * * 

We warmly thank all contributors to this volume for their timely and high- 
quality work, as well as A. Hofmann, from Springer-Verlag, Heidelberg, for the 
efficient and pleasant cooperation. 

September 2001 Cristian S. Calude 

Gheorghe Paun 
Grzegorz Rozenberg 
Arto Salomaa 
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Abstract. The aim of this paper is to show that some computational 
models inspired from biological membranes, snch as P systems, can be 
simulated on digital compnters in an efficient manner. To this aim, it 
is necessary to characterize non-determinism and parallel execution of 
evolution rules inside regions. Both these issues are formally described 
here in order to obtain a feasible description in terms of data structures 
and operations able to be implemented in a functional programming 
language. Static and dynamic strnctures of transition P systems are for- 
malised in order to define a bio-language to represent them. Finally, a 
draft of a language for describing a transition P systems is presented. 
It will facilitate the description of transition P systems in terms of sen- 
tences in a high level programming langnage; snch sentences will define a 
program. A process of compilation will parse the program to appropriate 
data structures and will lannch the execution of the simnlation process. 



1 Introduction 

Membrane computing is a new research field, of a biological inspiration, intro- 
duced by Gh. Paun in [4]. This new computational paradigm tries to imitate the 
way nature computes at the cellular level. Any biological system can be viewed 
as a hierarchical construct where the interchange of materials takes place. This 
process of material interchange can be interpreted as a computing process [4]. 
Taking into account this point of view, we can say that membrane computing 
is a general architecture of “living organisms” , in the sense of Artificial Life [5] . 
Computing with membranes uses a hierarchical membrane structure in the same 
sense that it is used in the chemical abstract machine of Berry and Boudol [3], 
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and the basic ideas of evolution rules as multisets transformation are inspired 
in r systems introduced by Banatre [1]. Here we present the classical general 
description of transition P systems for determining their main components. The 
transition P systems structure is recursively defined, as composed by several 
membranes enclosed by a unique membrane named “skin” . This structure de- 
fines a hierarchical compartmental structure that can be easily represented as a 
Venn diagram. The membrane structure is the main component of a transition 
P system. Membranes are uniquely labelled and they delimit regions or vesicles 
where there exist objects, represented by a multiset, and a set of evolution rules. 
The objects are represented by symbols from a given alphabet. Usually, an evo- 
lution rule from a determined region has the form r : ca ^ cbm^da^^^ and it 
says that a copy of a together with a copy of the catalyst c (catalysts are objects 
which are never modified but they must be present to apply the rule) is replaced 
by a copy of the object d which remains in the membrane, while two copies of 
the object a are sent out through the membrane of the region, and a copy of the 
object b is sent to the region labelled by j. Note that the rule r can be applied 
if and only if the region includes directly the membrane j. In general, the use 
of the expression “a copy of the object” is reduced to “the object” in order to 
make descriptions of computations in membrane systems simpler. However, it is 
important to bear in mind that copies of objects are used when evolution rules 
are applied in regions. 

P systems evolution is performed in a non-deterministic and massively par- 
allel manner. That is, inside every region of the P system, the objects to evolve 
and the evolution rules that can be applied are chosen in an exhaustive and 
non-determinist way. Evolution in membranes is synchronous, i.e., evolution in- 
side regions is governed by an external clock. In each time unit, every region 
evolves and changes its multiset of objects by application of evolution rules of 
the region. This means that in each time unit the P system changes its state or 
configuration. 

Computations in P systems are achieved by sequences of transitions between 
configurations. A configuration is halting if no rule is applicable in any region. 
We say that a computation is successful if we reach a halting configuration, and 
the result of the computation is the number of objects sent to the environment 
through the skin or to a given elementary membrane during the computation. 

To this basic model many modifications can be added. Here we consider only 
one more feature: a priority relation among rules in regions. This new feature 
means that only rules with the higher priority can be used in a given evolution 
step. This feature permits to establish an order relationship among rules and in 
some sense to project a kind of execution program in the evolution rules set. 



2 Static Structure of Transition P Systems 

In this section we present the simplest variant of P systems, transition P systems. 
It is the basic model with the added feature of a priority relation on the set of 
evolution rules. 
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We introduce the main components of transition P systems in order to define 
them in a constructive manner and then to define the appropriate data structures 
for representing transition P systems on a functional programming language. 



2.1 Membrane Structures 

As we have said above, the membrane structure is the main component of a P 
system and it can be pictorially represented by a Venn diagram; it can also be 
mathematically represented by a tree or by a string of matching parentheses. In 
order to define appropriately strings that represent membrane structurse let us 
consider the language MS over the alphabet {[,]}, whose strings are recurrently 
defined as follows: 

- []&MS, 

- if /ii, • • • , G MS, n > 1, then [/ii • • • /r„] G MS, 

- nothing else is in MS. 

Over MS we define the equivalence relation ~ as follows. 

Let X, y G MS; then x ~ j/ if and only if x = /ii/X2/r3/i4 and y = yiy,3y,2yi, 
where /Xi, /X 2 , ^ 3, ^4 G MS. Let us denote MS/^ = MS. 

The elements oi MS are called membrane structures. Each matching pair 
of parentheses [, ] of a membrane structure is called a membrane. The external 
membrane of a membrane structure is called skin. A membrane without internal 
membranes (i.e., having the form []) is called an elementary membrane. 

The number of membranes in a membrane structure /i G MS' is called the 
degree of the membrane structure, it is represented by deg{y), and it is recursively 
defined by: 

~ <i^9{[]) = 1 . 

- deg{[yi ■ ■ ■ ^„]) = 1 + X]r=i deg{y,i). 

The number of “levels” of a membrane structure y G MS, the depth of y, 
can be recursively defined as follows: 

- dep{[]) = 1, 

- dep{[yi ■ ■ ■ yn]) = 1 + max{dep{yi), ■■ ■ , dep(yn)}. 

As it was stated above, membranes delimit regions. In a membrane structure 
y, the number of regions is equal to deg{y). The Venn diagram representation 
of a membrane structure helps to clarify the notion of a region. A region is any 
closed space delimited by membranes in a membrane structure. It is also clear 
that a membrane structure of degree n contains n regions, one associated with 
each membrane. Moreover, we will say that two regions are adjacent if and only 
if there is only one membrane between them. This fact is important because 
communication between two regions is possible only if they are adjacent. 
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2.2 Multisets 

Regions defined by membranes are vesicles able to contain copies of objects from 
a given alphabet. Therefore, a natural way to represent objects inside regions is 
by using multisets. A compact representation of a multiset is by means of words 
over a given alphabet. 

A multiset over a given object set U can be formally defined as follows: 

Let U be an arbitrary set and let N be the natural number set: a multiset 
M over U is a mapping from U to N, M : U ^ N, which we will use to write 
in the form a Ma. 

Every mapping can be also represented in form of a set of coordinated pairs: 
M = {(a,Ma) \ a G U}. 

Multisets can be also represented by a polynomial with exponents in the 
natural number set; this third representation is more compact and natural, and 
it is used in most of the papers about P systems; the polynomial representation 
is more intuitive and it expresses better than the others the sense of number of 
copies of objects from the alphabet in the region: 

• • • z”" = {(a, Ua), (&, n-h), • • • , (z, n^)}. 

It is important to know which objects are really present in the region with 
at least one copy. That subset of a multiset is called the support of the multiset: 
Supp M = {a G U \ Ma > 0}. 

Moreover, we distinguish the Empty Multiset represented by 0, and defined 
as 0 : [/ ^ N with a Oa. 

Several operations on multisets can be defined. Let M, Mi, and M2 be mul- 
tisets over U. Then: 

Multiset Inclusion: Mi C M2 if and only if Vo G U, (Mi a) < (M2 a). 

Multiset Union: Va G U, (Mi U M2) a = (Mi a) + (M2 a). 

Multiset Difference: Va G U, (Mi — M2) a = (Mi a) — (M2 a), if Mi C M2. 



2.3 Super-cell 

The super-cell concept provides the possibility of adding objects to a membrane 
structure. This fact transforms the membrane structure in a computing device 

[4]. 

Let [/ be a denumerable set of objects. Let p, be a membrane structure with 
its membranes injectively labelled with natural numbers from 1 to deg{p). The 
regions associated with membranes are labelled with the same label as their 
external membranes. 

A partial order relation can be defined over the label set of p LABELS = 
{1, • • • , deg{p)} as follows: Let i,j G LABELS, then we can say that i <fj, j iff 
region i contains region j directly or i contains a region k which contains j. 
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We define now the concept of adjacent regions. Let i,j € {1, • • • , de(/(/x)} be 
labels for two regions of /r. We say that region i is adjacent to region j iff: 

{i <f, jA G {!,• • ■ ,deg{fx)} \ i k A k <f, j) V 

{j iA ^k G {!,• • ■ ,deg{fi)} | j fc A fc i). 

A super-cell is obtained from a membrane structure fx by associating with 
each region of the membrane structure /r a multiset over U in such a manner 
that for i varying from 1 to n = deg{ii), the region i has associated the multiset 
Mi : [/ ^ N. The multiset Mi associated with the region i is said to be the 
contents of region i. 

2.4 Evolution Rules 

Evolution rules make evolve the region with which they are associated, changing 
the region contents and sending objects to other regions; they can also make 
disappear the external membrane of the region. An evolution rule is formed by 
one antecedent and by one consequent. Both of them can be represented by 
multisets over different sets. An evolution rule can be defined as follows. 

Let L be a label set, let U be an object set, and let D = {out, here} U {in j \ 
j G L}. An evolution rule is a triple (u,v,S), where t6 is a multiset over U, v is 
a multiset over U x D and S G {dissolve, not dissolve}. 

We can define several operations on evolution rules. 



Definitions: Let r = {u,v,S),ri = (ui, ui, 5i), and r 2 = (u 2 , U 2 , <52) be evolution 
rules with labels in L and objects in U. Let a € U and n be natural number. 

Addition of Evolution rules: ri -|- r 2 = (ui U U2,vi U V2,di V d2). 

Product of an evolution rule by a natural number: n r = 

Inclusion of evolution rules: ri C r2 if and only if ui C U2- 

Input of an evolution rule: Input r = u. 

Dissolve: Dissolve r = 6 . 

Also, it is necessary to define some functions over evolution rules related to 
the rule consequent. These functions will provide important information about 
what happens when the rule is applied. 

The set of membranes where the rule is sending objects is 

Ins r = {j G L I (_, in j) G Supp u}. 

Evolution rule’s outputs: 

{OutputToOut r)a = v{a,out), 

{OutputToH ere r)a = v(a, here), 

{{OutputToIn j)r)a = v{a, in j). 

Functions OutputToOut, OutputToH ere, OutputToIn, return the multiset 
that the rule is sending to his father (ToOut), to itself (ToHere) and to a 
determined region (Toln- ■ ■). These functions will be very useful in order to 
define the changes produced during the evolution of a transition P system. 




Fernando Arroyo et al. 



2.5 Regions 

A region is an inter-membranes space; the regions form compartments which 
contain objects and a set of evolution rules able to make evolve the region. 
Therefore, a region is the area delimited by a membrane r and any membrane 
directly inside r. Regions in the membrane structure of a transition P system are 
uniquely labelled by elements from some set L, contains objects and a partially 
ordered set of evolution rules. In order to define more precisely a region, let L be 
a label set and U be an object set. A region with a label in L and objects in U is 
a triple (/,w, {R, p)) where / G L, w is a multiset over U, R is a, set of evolution 
rules with labels in L and objects in U and p is a partial order relation over R. 



2.6 Transition P Systems 

In order to provide a formal definition of transition P systems [4], it will be 
necessary to take into account the previous given definitions: super-cell and 
evolution rules. 

Let U he a set (alphabet) whose elements are named objects. A transition P 
system over U is a construct 



R {Pi ; * * * j ^deg{fi) ? (-^1 5 ) 7 ‘ ‘ ‘ ? (^Rdeg(p) 1 Pdeg{p) ) ) ^0)5 

where: 

i. /i is a membrane structure with membranes labeled in a one-to-one manner 
with natural numbers from 1 to deg{p). 

ii. uji, 1 < i < deg{p), is a multiset (word) over U associated with region i. 

iii. Ri, 1 < i < deg{p), is a finite set of evolution rules associated with region i. 

iv. Pi, 1 < i < deg{pi), is a partial order over Ri. 

V. 1 < to ^ deg{p), is the label that identifies the output membrane of the 
system. 

It is possible to distinguish two different parts in a transition P system: 
- ■ ■ ,0Jdeg(p)) defines a super-cell, it is the static part of the system, while 
the second part, {{Ri, pi), ■ ■ ■ , (Rdeg{g.), Pdeg(p)),io), is the dynamic component 
of the P system, and provides the evolving capacity to it. 

This definition corresponds to a transition P system with a membrane output, 
but it is possible to define different variants, for instance, without an output 
membrane, or with multiples output membranes. 



3 Data Structures for Representing Transition P Systems 

Until now, we have presented all the components of transition P systems de- 
scribed in terms of algebraic structures, with their corresponding operations 
and functions [2]. Now it is necessary to translate such algebraic structures into 
data structures. In order to do this, we have several possibilities, because: 




Structures and Bio-language to Simulate Transition P Systems 



7 



i. The same data structure with different operations on it can represent differ- 
ent algebraic structures. 

ii. The same algebraic structure can be represented by different combinations 
of data structures with operations. 

Therefore, the main difference between two data structures and their asso- 
ciated operations representing the same algebraic structure will be efficiency in 
the computing time and allocated memory. To represent an algebraic structure 
into a data structure the best choice will be the one that allows the highest 
efficiency maintaining the algebraic structure original properties. 

From a computational point of view, the algebraic structure used for defining 
transition P systems has one essential characteristic: regions and their contents 
are independents, but they are linked by labels. So, these facts make easy to 
design a structure of references (tree, graph, etc.) that automatically build a 
structure for referred items. 

These kinds of structures are called independent references. In what follows, 
we present a data structure which does not need to refer to regions in order to 
define its structure, but it builds directly a region structure. It is called a regions 
tree. 

A tree, algebraically speaking, can be represented by multiple data structures. 
Among them: 

i. An array or vector of pairs of labels in the form (father’s label, son’s label). 

ii. A list of pairs of labels in the form (father’s label, son’s label). 

iii. A square matrix whose rows and columns are marked will be the labels. 

However, the variants which permit a more efficient treatment due to a min- 
imal unnecessary search are the following: 

i. An array or vector of pairs in the form (father’s label, list of sons’ labels) 
with labels as indices. The main advantage of this structure is that it permits 
a direct access from every region to its father’s region, and this is due to 
the fact that it represents a bi-directional tree. The structure has a certain 
redundancy degree, the same label can appear as a father index of some node 
and as the son index of another node. 

ii. A general non-empty tree whose elements are labels. In this case, labels 
appear only once in the structure. This structure is more similar in form to 
the original definition of transition P systems. 

Definitions: A tree over an arbitrary set S' is a pair where the first compo- 
nent is an element of S and the second one is a set of trees over S. 

In this definition, the second element of the pair can be the empty set, a case 
when the element is a leaf. 

Over trees it can be defined the multiplicity function which provides the 
number of occurrences of a given element a € U in the tree. 

Multiplicity: Let T = {u, S) be a tree, where u G U and S is a set of trees 
over U . Then 



{Mult T)a 



\ + ^g^g{Mult s)a ifo = u, 
ifayfu. 
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The membrane structure of a transition P system defines a tree whose root 
is the “skin” and elementary membranes are the leaves. 

Finally, for defining a transition P system, it is necessary to provide a content 
to membranes. With all given definitions, a transition P system can be defined 
as a tree of regions uniquely labelled. In a formal way, we have: 

Let L be a label set, let C/ be a set of objects. A transition P system with 
labels in L and objects in {7 is a tree II of regions with labels in L and objects 
in U provided that: 

VI G L,{Multn){l,_,_) < 2. 

An alternative definition can be given: A transition P system with labels in 
L and objects in 17 is a pair whose first element is a region with labels in L and 
objects in U and the second one is a set of transition P systems with labels in 
L and objects in U, whose regions are uniquely labelled: 

n = {{l,u,{R,p)),nn) andVI G L,{Multn){l,., _) < 2, 

where w is a multiset over U, R is a set of evolution rules over L and 17, p is a 
partial order relationship over R and nil is a set of transition P systems with 
labels in L and objects in U. 

This algebraic representation define precisely the static structure of transition 
P systems. We have named it static structure because this representation do 
not take into account the evolution in transition P systems, it only represents 
transition P systems in a static manner. 

4 Transition P Systems: The Dynamic Strnctnre 

We call dynamic structure of transition P systems everything which is related 
to evolution. During the evolution of a transition P system there are produced 
many changes. For instance, the static structure may be changed because of 
the execution of some evolution rules that dissolve membranes; moreover, the 
contents of the regions are changed by the parallel application of evolution rules. 
If a rule dissolves the external membrane of a region, then the region disappears 
from the static structure of the P system, the region sends its objects to its 
father region, and its evolution rules disappear from the system. 

The dynamics of a transition P system has been described in terms of parallel 
and non-deterministic events: “evolution is done in parallel for all objects able to 
evolve” , and “all these operations are done in parallel, for all possible applicable 
rule u V, for all occurrences of multiset u in the region associated with the 
rules, for all regions at the same time” [4]. 

During evolution in P systems, it is possible to distinguish between at least 
two different types of parallelism. The first one is the regional parallelism and 
it is related to what is happening inside a region. The second one is the system 
global parallelism and it is related to what is happening in every region of the 
P system at the same time. Both of them are involved in the system evolution. 
They are interrelated because regions may disappear from the static structure 
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of the P system and evolution rules send objects to other regions; but in some 
way, they can be independently described. 



4.1 Regional Parallelism 

This point is related to the need to characterize the parallel application of evo- 
lution rules. We will describe the dynamic evolution in a given region of a P 
system in a determined step of its evolution. The aim is to determine a set of 
evolution rules able to make evolve the region by the application of one of these 
rules. 

A region has been defined as a triple: 

Region : {label, multiset, {Rules, Partial Order Relation)) = {I , uj , {R, p)) . 

The evolution inside the region involves objects consumption by rules, send- 
ing objects to other regions and, possibly, membrane dissolution, which conse- 
quently implies that the region vanishes. All these operations are done in parallel 
with only one constraint, that a rule with a higher precedence inhibits rules of a 
lower precedence. Therefore, only rules with a higher precedence can be applied 
in order to make evolve regions. 

If we want to define exactly the rules able to make evolve a region, then we 
need the following definitions: 

Maximal Set: Let {U, <) be a partial order over U; then 
{{Maximal U) <) = {u £ U \ G U,u < v} = {u \ {u G U) A {^{u < _))}. 

Useful Rules: An evolution rule in a rules set is useful over a given labels 
set L if and only if the set Ins r is included in L. In a more formal way, let R 
be an evolution rule set with labels in L and objects in U. Let L' C L. Then 

{U seful R)L' = {r G i? I Ins r C L'} = {r | (r G i?) A {Ins r C L')}. 

By the definition of a transition P system, the objects can pass through only 
one membrane. Therefore, a rule can send objects to regions separated by only 
one membrane. In order to calculate the rules useful in a region, let L' be the 
set of labels corresponding to adjacent regions to the given region. 

Applicable Rules: A rule is applicable to a multiset co if and only if its 
antecedent is included in uj. Let i? be a set of evolution rules with labels in L 
and objects in U. Let w be a multiset over U. Then 

{Applicable R)uj = {r G R \ Input r C„ w} = {r | (r G i?) A {Inpmt r C„ ic)}. 

Therefore, a rule is applicable in a region if and only if the rule antecedent is 
included in the multiset contained by that region. In addition, in a determined 
evolution step of a transition P system, applicable rules are the only ones that 
can be used. 
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Active Rules: A rule r G Ris active with respect to a partial order relation 
p defined on R if and only if there is no rule in R higher than r according p. 
Therefore, actives rules are characterized as follows. 

Let p be a partial order relation defined over a set R of evolution rules with 
labels in L and objects in U] then: 

{Active R)p = {Maximal R)p. 

Adjacent regions to a region: Let reg be a region of a transition P system 
n with labels in L and objects in U. Let {reg,nil) be the sub-tree of II with 
the root reg. We define the adjacent region in II as: 

ADJACENTn reg = {r \ {r, 7T77') G 7T7T} = {r | (r, _) G iTTT}. 

Rules able to make evolve the region: With these definitions, it is pos- 
sible to determine exactly the set of evolution rules that can be applied in order 
to make evolve the region reg. By definition, they are those rules satisfying the 
following equations: 



L' = {I gL \ {I, -) G ADJACENTn reg} 

Active{Applicable{{U seful R)L')uS)p. (1) 

The evolution of a transition P system is described inside a region as follows: 
every rule satisfying (1) can be applied in parallel for all occurrences of rule 
antecedent multiset, for all regions at the same time. Therefore, we must search 
for a linear combination of rules satisfying (1) being complete; that is, no further 
rule satisfying (1) can be added to the linear combination. Let us explain this 
last sentence. 

The application of two rules ri , r 2 G i? at the same evolution step in a region, 
in parallel, is equivalent to applying the rule ri-|-r 2 . In this sense, we can say that 
to apply n times the same rule in parallel has the same effect as applying the rule 
defined by nr. Therefore, the parallel application of evolution rules inside a region 
can be substituted by the application of one linear combination of evolution rules 
of the region. What is necessary is that such a linear combination is maximal 
in covering the multiset of objects in the region, i.e., it is a complete linear 
combination of evolution rules in the region. Finally, the region evolves in one 
step by the application of only one rule formed by a complete linear combination 
of evolution rules in R satisfying (1). 

Any linear combination of evolution rules can be represented as a multiset 
over R, the set of evolution rules of the region, as follows. Let R= {ri, • • • , r„} be 
the set of evolution rules of region reg. Then any linear combination of evolution 
rules in R and scalars in N can be represented by a multiset over R as follows: 
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4.2 Characterization of Complete Multisets of Evolution Rules 
in a Region 

Once it is known how can be possible to make evolve a region in one step, we 
need to characterize multisets of complete evolution rules. Until now, we have 
provided an exact definition of rules that can be applied to evolve a region of 
a transition P system in one evolution step. Moreover, the parallel execution of 
rules inside the region can be replaced by the execution of only one rule. Those 
rules are a linear combination of active rules of the region expressed in terms of 
rules from R - the set of evolution rules of the region. What follows are several 
definitions over the multiset of evolution rules aiming to obtain an expression 
for identifying complete multisets of evolution rules in the given region. 



Definitions: Let i? be a set of evolution rules with labels in L and objects in 
U. Let MR, MRi, and MR 2 be multisets over R and let w be a multiset over U. 

MultiAdd: This operation performs the transformation of a multiset of 
evolution rules into a linear combination of evolution rules. From now on, we 
will keep in mind this equivalence between linear combinations of evolution rules 
and multisets of evolution rules. 

®MR= J2iMRr)r. 

r^R 

Multiinclusion: This binary relation defines a partial order relation over 
the multiset of evolution rules over R. It will be very useful in order to determine 
which multiset over R will be complete. 

MRi MR 2 = (©Mi?i) Cu (©Mi?2). 

MultiApplicable: This function defines the set of multisets over R (the 
evolution rules set) that can be applied to uj. 

{MultiApplicable R)uj = {MR \ {MR : R N) A {{Input (©Mi?)) C ru)}. 

MultiComplete: 

{Multicomplete R)uj = {Maximal{{Multi Applicable R)oj)) ©u . 

The above expression describes the set of complete multisets over (i?, C„) 
and UJ. 

Finally, let iT be a transition P system with labels in L and objects in U, let 
reg = {l,co, {R, p)) be one region of il and let {reg,IIII) be the sub-tree of il 
whose root is reg. The set of complete multisets of evolution rules is defined by 
the following equations: 



LABELS = {/ G L|((;, _,_),_) G TTiT}, 

ACTIVES = Active{Applicable{{Lf seful R)LABELS)uj)p, 
COMPLETES = {Multicomplete ACTIVES)oj. 
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The LABELS set defines the set of labels of regions adjacent to region reg. 
The ACT IV ES set defines the set of evolution rules that can be applied in order 
to make the region evolve. The COMPLETES set defines the set of complete 
multisets over R (the set of evolution rules of the region) capable of making 
evolve the region in one step. Therefore, there are as many different possibilities 
for the region evolution as many elements COMPLETES has. 



4.3 Regional Non-determinism and Global Non-determinism 

The above section has given a procedure for characterizing the set of evolution 
rules multiset over R (the set of evolution rules of a given region) such that 
their transformation into linear combinations by application of the Multi Add 
function is able to make evolve the region in one step. It has been also stated 
that there are many possibilities for making evolve the region. The number of 
different possibilities is equal to the cardinal of COMPLETES set. Therefore, a 
non-deterministic selection of a multiset in COMPLETES is enough for having 
a non-deterministic evolution in the region. 

The global non-determinism in the system is achieved through the regional 
local non-determinism. P systems are devices with a high degree of parallelism. 
Evolution is done in parallel in every region of the system. When regions ex- 
ecute the selected linear combination of evolution rules, according to a non- 
deterministic selection of a complete multiset of rules belonging to the region 
and evolve to a new region configuration, then a new configuration for the P 
system is achieved and the step of evolution is finished. We have to be aware 
that P systems are synchronous devices, i.e., in order to get a new configuration 
for the system, all the regions must finish their evolution and only when the new 
system configuration is reached a new evolution step may start. 



5 The Software System 

Based on this formalism, we are developing a set of software modules in order to 
implement transition P systems on digital computers. The software system has 
two different kinds of modules, the first one related to algebraic structures, and 
the second one related to the transition P systems architecture. 

Modules included in the algebraic structures are tuples, sets, multisets, and 
relations. In the P system architecture we can find modules of directions, rules, 
regions, and transition P system. The functional dependencies among modules 
are shown in Figure 1. 

6 Bio-language for Representing Transition P Systems 

Programming a transition P system is not writing a code in a determined pro- 
gramming language, but developing a transition P system architecture (mem- 
brane structure) and projecting on it data (an object multiset) and operations 
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Fig. 1. Functional dependencies among modules of the software system 



- evolution rules with a priority relation among them. Then, the code execution 
is obtained making evolve the P system, and results are obtained by examina- 
tion of a given region in the system. Of course, the code execution is made in 
a non-deterministic and massively parallel way. However, it will be very useful 
if P systems designers could have a special language for developing P systems 
architectures. Such a language must express the special characteristics of P sys- 
tems and, through a process of compilation, should translate the architecture to 
the needed data structures for simulating the execution on a digital computer 
and launch the execution of the code. This is the usual process of compilation 
in programming languages (Figure 2). 

The process of compilation assumes the task of main function or launcher of 
the distributed processed embedded in evolution rules of P systems regions. 

Now we will describe the syntax of our bio-language by rewriting rules: 

— R a set of objects, 

— La set of labels, 

— Transition P system (TPS): IT [i Region; {n}]i, 

— Region Objects, Rules, Priorities, 

— Objects {o"}, where o G V,n G N, 

— Rules X\rule{,rule}, 

— Priorities A|ri < rj{,r^ < r'}, where G N, 

— Addresses here\out\l where I G L, 

— ObjTarg {{o, Addresses)^}, where o G R and n 1|2| • • •, 

— Rule rm : Objects ObjTarg S, where <5 G {dissolve, not dissolve}. 

These rewriting rules define the syntax of the TPS bio-language for deter- 
mining the static structure of a transition P system at any step of the evolution. 
The internal code execution is embedded in the transition P system architecture 
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Fig. 2. Compilation and decompilation processes during transition P system evolution 



through the evolution rules of the transition P system rules, which define the 
internal distributed functions of the P system. We have named bio-language this 
language because sentences are much closed to the particular components of P 
systems (membranes, regions, objects and evolution rules), and such systems 
have a biological inspiration. 

7 Conclusions 

This work deals with the formalisation problem of the static and dynamic struc- 
tures of transition P systems in terms of computational data structures. It has 
been also needed to solve the non-deterministic and parallel execution of evolu- 
tion rules in regions. A first approach to a bio-language for transition P systems 
representation has been presented. Sucha bio-language will facilitate the task 
of developing programs, representing P systems and to execute them on digital 
computers. 

We believe that simulations of some models of membrane computing may help 
to spread these new computational paradigms (membrane computing) outside 
the actual group of researchers and to present them to the computer science 
community as feasible models for solving NP-complete problems even in linear 
time. 
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Abstract. Gamma was originally proposed in 1986 as a formalism for 
the definition of programs without artihcial sequentiality. The basic idea 
underlying the formalism is to describe computation as a form of chemical 
reaction on a collection of individual pieces of data. Due to the very 
minimal nature of the language, and its absence of sequential bias, it 
has been possible to exploit this initial paradigm in various directions. 
This paper reviews most of the work around Gamma considered as a 
programming or as a specification language. A special emphasis is placed 
on unexpected applications of the chemical reaction model, showing that 
this paradigm has been a source of inspiration in various research areas. 



1 The Basic Chemical Reaction Model 

The notion of sequential computation has played a central role in the design of 
most programming languages in the past. This state of affairs was justified by 
at least two reasons: 

— Sequential models of execution provide a good form of abstraction of algo- 
rithms, matching the intuitive perception of a program defined as a “recipe” 
for preparing the desired result. 

— Actual implementations of programs were made on single processor archi- 
tectures, reflecting this abstract sequential view. 

However the computer science landscape has evolved considerably since then. 
Sequentiality should no longer be seen as the prime programming paradigm but 
just as one of the possible forms of cooperation between individual entities. 

* This paper is a revised version of Gamma and the chemical reaction model: ten years 
after [10]. It has been reorganized and includes additional sections on applications of 
the chemical reaction model. Sections presenting large examples, extensions of the 
formalism and implementations issues have been seriously shortened. The reader is 
referred to [10] and the original papers for further details on these topics. 



C.S. Calude et al. (Eds.): Multiset Processing, LNCS 2235, pp. 17—44, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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The Gamma formalism was proposed fifteen years ago to capture the intuition 
of computation as the global evolution of a collection of atomic values interacting 
freely. Gamma is a kernel language which can be introduced intuitively through 
the chemical reaction metaphor. The unique data structure in Gamma is the 
multiset which can be seen as a chemical solution. A simple program is a pair 
{Reaction condition. Action). Execution proceeds by replacing in the multiset 
elements satisfying the reaction condition by the products of the action. The 
result is obtained when a stable state is reached, that is to say when no more 
reactions can take place. The following is an example of a Gamma program 
computing the maximum element of a non-empty set. 

max : x, y ^ y x < y 

The side condition x < y specifies a property to be satisfied by the selected ele- 
ments X and y. These elements are replaced in the set by the value y. Nothing is 
said in this definition about the order of evaluation of the comparisons. If several 
disjoint pairs of elements satisfy the condition, the reactions can be performed in 
parallel. Let us consider as another introductory example a sorting program. We 
represent a sequence as a set of pairs {index, value) and the program exchanges 
ill-ordered values until a stable state is reached and all values are well-ordered. 

sort : {i,x), {j,y) ^ {i,y), {j,x) ^ {i > j) and {x < y) 

The possibility of getting rid of artificial sequentiality in Gamma confers a very 
high level nature to the language and allows the programmer to describe pro- 
grams in a very abstract way. In some sense, one can say that it is possible in 
Gamma to express the very “idea” of an algorithm without any unnecessary 
linguistic idiosyncrasy (like “exchange any ill-ordered values until all values are 
well ordered” for the sorting algorithm) . This also makes Gamma suitable as an 
intermediate language in the program derivation process: Gamma programs are 
easier to prove correct with respect to a specification and they can be refined for 
the sake of efficiency in a second stage. This refinement may involve the intro- 
duction of extra sequentiality but the crucial methodological advantage of the 
approach is that logical issues can be decoupled from implementation issues. 

To conclude this introduction, let us quote E.W.Dijkstra thirty years ago 
[27]: “Another lesson we should have learned from the recent past is that the de- 
velopment of “richer” or “more powerful” programming languages was a mistake 
in the sense that these baroque monstrosities, these conglomerations of idiosyn- 
crasies, are really unmanageable, both mechanically and mentally. I see a great 
future for very systematic and very modest programming languages” . We believe 
that this statement is more relevant than ever and the very minimal nature of 
the original Gamma formalism is one factor which has made possible the various 
developments that are sketched in this paper. 

We start by providing in section 2 the basic intuitions about the program- 
ming style entailed by Gamma as well as various programming examples and 
extensions. 
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Gamma was originally proposed in the context of a work on systematic pro- 
gram derivation [8] . Section 3 describes how Gamma can be used as an interme- 
diate language in the derivation of efficient implementations from specifications. 

Since the initial developments, Gamma has been a source of inspiration in 
unexpected research areas. Section 4 shows several applications of the chemi- 
cal reaction model in various domains such as the semantics of process calculi, 
imperative programming and software architectures. Section 5 concludes by a 
sketch of related work. 

2 Gamma as a Programming Language 

Using the chemical reaction model as a basic paradigm can have a deep effect on 
our way of thinking about algorithms. We first try to convey the programming 
style favored by Gamma through some very simple examples. Then we intro- 
duce five basic programming schemes, called “tropes” which have emerged from 
our experience in writing Gamma programs. We proceed by presenting large 
applications written in Gamma and reviewing various linguistic extensions. 



2.1 A New Programming Style 

We first come back on the straightforward max program defined in the intro- 
duction to illustrate some distinguishing features of Gamma: 

max : x, y ^ y x < y 

In order to write a program computing the maximum of a set of values in a 
“traditional” language, we would first have to choose a representation for the 
set. This representation could typically be an array for an imperative language 
or a list for a declarative language. The program would be defined as an iter- 
ation through the array, or a recursive walk through the list. The important 
point is that the data structure would impose constraints on the order in which 
elements are accessed. Of course, parallel versions of imperative or functional 
programs can be defined (solutions based on the “divide and conquer” paradigm 
for example), but none of them can really model the total absence of ordering 
between elements that is achieved by the Gamma program. The essential feature 
of the Gamma programming style is that a data structure is no longer seen as 
a hierarchy that has to be walked through or decomposed by the program in 
order to extract atomic values. Atomic values are gathered into one single bag 
and the computation is the result of their individual interactions. A related no- 
tion is the “locality principle” in Gamma: individual values may react together 
and produce new values in a completely independent way. As a consequence, a 
reaction condition cannot include any global condition on the multiset such as V- 
properties or properties on the cardinality of the multiset. The locality principle 
is crucial because it makes it easier to reason about programs and it encapsulates 
the intuition that there is no hidden control constraints in Gamma programs. 
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Let us now consider the problem of computing the prime numbers less than 
a given value n. The basic idea of the algorithm can be described as follows: 
“start with the set of values from 2 to n and remove from this set any element 
which is the multiple of another element” . So the Gamma program is built as the 
sequential composition of iota which computes the set of values from 2 to n and 
rem which removes multiples. The program iota itself is made of two reactions: 
the first one splits an interval {x, y) with x yf j/ in two parts and the second one 
replaces any interval (x, x) by the value x. 

primes{n) = rem{iota{{{2,n)})) 

iota = (x,y) ^ lx,[{x + y)/2]), {[{x + y)/2] + l,y) ^ x y 

(x,y) ^ X ^ X = y 

rem = x, y ^ y ^ multiple{x, y) 

The first reaction increases the size of the multiset, the second one keeps it 
constant and the third one makes the multiset shrink. In contrast with the usual 
sequential or parallel solutions to this problem (usually based on the successive 
application of sieves [9]), the Gamma program proceeds through a collection of 
atomic actions applying on individual and independent pieces of data. 

Another program exhibiting these expansion and shrinking phases is the 
Gamma version of the Fibonacci function: 

fib{n) = add{deci{n)) 
deci = X ^ X — 1, X — 2<J=x>l 
X ^ 1 X = 0 

add = X, y^x + 7/<J= True 

The initial value is decomposed by deci into a number of ones which are then 
summed up by add to produce the result. The first phase corresponds to the 
recursive descent in the usual functional definition 

fib{x) = if X < 1 then 1 else fib{x — 1) + fib{x — 2) 

while the reduction phase is the counterpart of the recursive ascent. However the 
Gamma program does not introduce any constraint on the way the additions are 
carried out, which contrasts with the functional version in which additions must 
be performed following the order imposed by the recursion tree of the execution. 

As a last example of this introduction, let us consider the “maximum segment 
sum” problem. The input parameter is a sequence of integers. A segment is a 
subsequence of consecutive elements and the sum of a segment is the sum of its 
values. The program returns the maximum segment sum of the initial sequence. 
The elements of the multiset are triples (i,x,s) where i is the position of value 
X in the sequence and s is the maximum sum (computed so far) of segments 
ending at position i. The s field of each triple is originally set to the x field. The 
program maxi computes local maxima and maXg returns the global maximum. 

maxss{M) = maXg{maxi{M)) 

maxi =(i,x,s), (i',x',s') ^ (i,x,s), (i',x',s + x') 

<= (i' = i + 1) and {s + x' > s') 
maXg = (i,x,s), (i',x',s') (i',x',s') ■<= s' > s 
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2.2 The Tropes: Five Basic Programming Schemes 

The reader may have noticed a number a recurrent programming patterns in 
the small examples presented in the previous section. After some experience 
in writing Gamma programs, we came to the conclusion that a very small 
number of program schemes were indeed necessary to write most applications. 
Five schemes (basic reactions), called tropes (for fransmuter, reducer, optimiser, 
expander, selector) are particularly useful. We present only three of them here: 

— Transmuter. 

T{C,f)=x^ f{x)^C{x) 

The transmuter applies the same operation to all the elements of the multiset 
until no element satisfies the condition. 

— Reducer. 

T^{C, f) = x,y ^ f{x,y) <= C{x,y) 

This trope reduces the size of the multiset by applying a function to pairs 
of elements satisfying a given condition. The counterpart of the traditional 
functional reduce operator can be obtained with an always true reaction 
condition. 

— Expander. 

£{C, /i, / 2 ) = X ^ <= C{x) 

The expander is used to decompose the elements of a multiset into a collec- 
tion of basic values. 

The Fibonacci function can be expressed as the following combination of tropes: 

fib{n) = add ( zero ( dec ({n}))) 
dec =£{C,fi,f 2 ) where 

C{x) = x > 1, fi{x) = X - I, / 2 (x) = X - 2 
zero = T (C, /) where 

C(x) = (x = 0), /(x) = 1 
add =TZ{C,f) where 

C{x,y) = True, f{x,y) = x + y 

The maximum segment sum program presented in the previous section can 
be defined in terms of tropes in a very similar way. Further details about tropes 
may be found in [35]. 

2.3 Larger Applications 

The interested reader can find in [9] a longer series of examples chosen from a 
wider range of domains (string processing problems, graph problems, geometric 
problems). We just sketch in this section a small selection of applications that we 
consider more significant, either because of their size or because of their target 
domain. 




22 



Jean-Pierre Banatre, Pascal Fradet, and Daniel Le Metayer 



Image processing application. Gamma has been used in a project aiming at 
experimenting high-level programming languages for prototyping image process- 
ing applications [24]. The application was the recognition of the tridimensional 
topography of the vascular cerebral network from two radiographies. A version 
of this application written in PL/1 was in use before the start of the experiment 
but it was getting huge and quite difficult to master. One of the benefits of 
rewriting the application in Gamma has been a better understanding of the key 
steps of the application and the discovery of a number of bugs in the original 
software. So Gamma has been used in this context as an executable specifica- 
tion language and it turned out to be very well suited to the description of this 
class of algorithms. The basic reason is probably that many treatments in image 
processing are naturally expressed as collections of local applications of specific 
rules. 

Another example of application of Gamma to image processing is reported 
in [47] . The aim of this application is to generate fractals to model the growth of 
biological objects. Again, the terseness and the facility of the expression of the 
problem is Gamma was seen as a great advantage. In both experiences however, 
the lack of efficient general purpose implementation of Gamma was mentioned 
as a serious drawback because it prevented any test on large examples. 

Reactive programming. In [58], an operating system kernel is defined in 
Gamma and proven correct in a framework inspired by the Unity logic [14]. 
An important result of this work is the definition of a temporal logic for Gamma 
(extended with a fairness assumption) and the derivation of the kernel of a file 
management system by successive refinements from a temporal logic specifica- 
tion. Each refinement step results in a greater level of detail in the definition of 
the network of processes. We are not aware of comparable attempts in the area 
of operating systems. 

2.4 Implementations 

A property of Gamma which is often presented as an advantage is its potential 
for concurrent interpretation. In principle, due to the locality property, each 
tuple of elements fulfilling the reaction condition can be handled simultaneously. 
It should be clear however that managing all this parallelism efficiently can be a 
difficult task and complex choices have to be made in order to map the chemical 
model on parallel architectures. The major problems to be solved are: 

1. The detection of the tuples which may react. 

2. The transformation of the multiset by application of reactions. 

3. The detection of the termination. 

This section sketches several attempts to provide parallel implementations for 
Gamma programs. 

Distributed memory implementations. Two protocols have been proposed 
[6,7] for the implementation of Gamma on network of communicating machines. 
They differ in the way rewritings are controlled: 
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— Centralized control. The elements of the multiset are distributed over the 
local memories of the processors. A central controller is connected to all 
the processors and monitors information exchanges. This protocol has been 
implemented on a Connexion Machine [23]. Other experiments have been 
conducted on the Maspar 1 SIMD machine: [46] describes an implementation 
of Gamma which results in a very good speed-up and a good exploitation 
of parallel resources. [45] shows how Higher-Order Gamma programs can be 
refined for an efficient execution on a parallel machine. 

— Distributed control. Information transfers are managed in a fully asynchronous 
way. The values of the multiset are spread over a chain of m processors. There 
is no central controller in the system and each processor knows only its two 
neighbors. The termination detection algorithm is fully distributed over the 
chain of processors; however, the cost of this detection can be high compared 
with the cost of the computation itself [6,7]. This solution has been imple- 
mented on an Intel iPSG2 machine [6,7] and on a Gonnexion Machine [23]. 
The results show a good exploitation of the processing power and speedup. 

Shared memory implementations. The Gamma model can also be seen as a 
shared memory model: the multiset is the unique data structure from which el- 
ements are extracted and where elements resulting from the reaction are stored. 
Shared memory multiprocessors are good candidates for parallel implementa- 
tions of Gamma. A specific software architecture has been developed in [33] in 
order to provide an efficient Gamma implementation on a Sequent multiproces- 
sor machine. Several techniques have been experimented in order to improve 
significantly the overall performances. A kernel operating system has also been 
developed in order to cope with various traditional problems and in particu- 
lar with the synchronization required by Gamma (a multiset element cannot 
participate in more than one reaction at a time) . 

Hardware implementation. The tropes defined in section 2.2 have been used 
as a basis for the design of a specialized architecture [62] . A hardware skeleton is 
associated with each trope and these skeletons are parameterized and combined 
according to the program to be implemented. A circuit can then be produced 
from a program description. The hardware platform was the PRL-DEG Perle 1 
board which is built around a large array of bit-level configurable logic cells [11]. 

2.5 Linguistic Extensions 

The Gamma programs that we have presented so far are made from a single 
block of reaction rules. In this section, we review several linguistic extensions for 
structuring programs or multisets. 

Composition operators for Gamma. For the sake of modularity, it is desir- 
able that a language offers a rich set of operators for combining programs. It is 
also fundamental that these operators enjoy a useful collection of algebraic laws 
in order to make it possible to reason about programs. Several proposals which 
have been made to extend Gamma with facilities for building complex programs 
from simple ones. 
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[ 34 ] presents of a set of operators for Gamma and studies their semantics and 
the corresponding calculus of programs. The two basic operators considered in 
this paper are the sequential composition P\ o P2 and the parallel composition 
Pi + P2- The intuition behind Pi o P2 is that the stable multiset reached after 
the execution of P2 is given as argument to Pi. On the other hand, the result 
of Pi + P2 is obtained (roughly speaking) by executing the reactions of Pi and 
P2 (in any order, possibly in parallel), terminating only when neither can pro- 
ceed further. The termination condition is particularly significant and heavily 
influences the choice of semantics for parallel composition. As an example of 
sequential composition of Gamma programs, let us consider another version of 
sort. 

sort' : match o init 

where init : {x (l,a;) 4 = integer{x)) 

match : ((i,x), (j,g) (i,x), (i+ l,g) ^ (x < g and i = j)) 

The program sort' takes a multiset of integers and returns an increasing list 
encoded as a multiset of pairs {index, value). The reaction init gives each integer 
an initial rank of one. When this has been completed, match takes any two 
elements of the same rank and increases the rank of the larger. 

The case for parallel composition is slightly more involved. In fact sort' could 
have been defined as well as: 

sort' : match + init 

because the reactions of match can be executed in parallel with the reactions 
of init (provided they apply on disjoint subsets, but this is implied by the fact 
that their respective reaction conditions are exclusive). As far as the semantics 
of parallel composition is concerned, the key point is that a synchronized ter- 
mination of Pi and P2 is required for Pi -\- P2 to terminate. It may be the case 
that, at some stage of the computation, none of the reaction conditions of, Pi 
(resp. P2) holds; but some reactions by P2 (resp. Pi) may create new values 
which will then be able to take part in reactions by Pi (resp. P2). This situation 
precisely occurs in the above example where no reaction of match can take place 
in the initial multiset; but init transforms the multiset and triggers subsequent 
reactions by match. Thus the termination condition of Pi -I- P2 indicates that 
neither Pi nor P2 can terminate unless both terminate and the composition as 
well. 

This new vision of parallel composition and its combination with the se- 
quential composition creates interesting semantical problems. [ 34 ] defines a set 
of program refinement and equivalence laws for parallel and sequential com- 
position, by considering the input-output behavior induced by an operational 
semantics. Particular attention is paid on conditions under which Pi o P2 can 
be transformed into Pi -I- P2 and vice-versa. These transformations are useful 
to improve the efficiency of a program with respect to some particular machine 
and implementation strategy. 

Several compositional semantics of the language have also been proposed 
[ 59 , 20 ]. Not all the laws established in the operational semantics remain valid 
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in these semantics; this is because they distinguish programs with identical in- 
put/output behavior but which behave differently in different contexts. It is 
shown however that most interesting properties still hold, the great advantage 
of a compositional semantics being that laws can be used in a modular way 
to prove properties of large programs. Other semantics of Gamma have been 
proposed, including [28] which defines a congruence based on transition asser- 
tions and [55] which describes the behavior of Gamma programs using Lamport’s 
Temporal Logic of Actions. 

Gomposition operators have been studied in a more general framework called 
reduction systems [60] which are sets equipped with some collection of binary 
rewrite relations. This work has led to a new graph representation of Gamma 
programs which forms a better basis for the study of compositional semantics 
and refinement laws. 

Higher-order Gamma. Another approach for introduction of composition op- 
erators in a language consists in providing a way for the programmer to define 
them as higher-order programs. This is the traditional view in the functional 
programming area and it requires to be able to manipulate programs as ordi- 
nary data. This is the approach followed in [41] which proposes a higher-order 
version of Gamma. The definition of Gamma used so far involves two different 
kinds of terms: the programs and the multisets. The multiset is the only data 
structure and programs are described as collections of pairs (Reaction Condi- 
tion, Action). The main extension of higher-order Gamma consists in unifying 
these two categories of expressions into a single notion of configuration. One 
important consequence of this approach is that active configurations may now 
occur inside multisets and reactions can take place (simultaneously) at different 
levels. Thus two conditions must be satisfied for a simple program to terminate: 
no tuple of elements satisfies the reaction condition and the multiset does not 
contain active elements. 

A configuration is denoted: 

[Prog, V ari = Multexpi , . . . , Gar„ = Multexpn]- 

It consists of a (possibly empty) program Prog and a record of named multi- 
sets V ari. A configuration with an empty program component is called passive, 
otherwise it is active. The record component of the configuration can be seen 
as the environment of the program. Each component of the environment is a 
typed multiset. Simple programs extract elements from these multisets and pro- 
duce new elements. A stable component Multexpi of a configuration C can be 
obtained as the result of C.Vari. 

The operational semantics is essentially extended with the following rules to 
capture the higher-order features: 

A ^ A' 

{A}0M^ {A'}0M 

[P, . . . Vark = Mfc, . . .] ^ [P, . . . Vark = M' , . . .] 
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The first and the second rule respectively account for the computation of active 
configurations inside multisets and for the transformation of multisets containing 
active configurations inside a configuration. Note that these rules are very similar 
to the chemical law and the membrane law of the Cham (section 4.1). 

Let us take one example to illustrate the expressive power provided by this 
extension. The application of the sequential composition operator to simple pro- 
grams can be defined in higher-order Gamma, and thus does not need to be 
included as a primitive. {P20 Pi){Mq) is defined by the following configuration: 

[Q, El = {[Pi, M = Mo]}, E2 = 0 ].E 2 

where Q = [0, M = Mi] : Pi ^ [P2, M = Mi] : P2 

Pi is a multiset containing the active configuration [Pi , M = Mq] initialy. Note 
that Q reactions only apply to passive values of Pi which means that Mi must 
be a stable state for Pi. Then the new active configuration [P2,M = Mi] is 
inserted into P2 and the computation of P2 can start. When a stable state 
is obtained, it is extracted from the top-level configuration through the access 
operation denoted by .P2. 

[41] shows how other useful combining forms can be defined in higher-order 
Gamma (including the chemical abstract machine) . It is also possible to express 
more sophisticated control strategies such as the scan vector model suitable for 
execution on fine-grained parallel machines. Another generalization of the chem- 
ical model to higher-order is presented in [21]. 

Structured Gamma. The choice of the multiset as the unique data constructor 
is central in the design of Gamma. However, this may lead to programs which are 
unnecessary complex when the programmer needs to encode specific data struc- 
tures. For example, it was necessary to resort to pairs (index, value) to represent 
sequences in the sort program. Trees or graphs have to be encoded in a similar 
way. This lack of structuring is detrimental both for reasoning about programs 
and for implementing them. The proposal made in [31] is an attempt to solve 
this problem without jeopardizing the basic qualities of the language. It would 
not be acceptable to take the usual view of recursive type definitions because 
this would lead to a recursive style of programming and ruin the fundamental 
locality principle (the data structure would then be manipulated as a whole). 

The solution proposed in [31] is based on a notion of structured multiset which 
can be seen as a set of addresses satisfying specific relations and associated with 
a value. As an example, the list [5; 2; 7] can be represented by a structured 
multiset whose set of addresses is joi, 02, 03} and associated values (written 
of) are Oi = 5, 02 = 2, 03 = ?. Let next be a binary relation and end a unary 
relation; the addresses satisfy 

next oi 02, next 02 03, end 03 

A new notion of type is introduced in order to characterize precisely the structure 
of the multiset. A type is defined in terms of a graph grammar (or rewrite rules). 
A structured multiset belongs to a type T if its underlying set of addresses 
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satisfies the invariant expressed by the grammar defining T. As an example, the 
list type can be defined by the following context-free graph grammar: 

List = L X 
L X = next x y, L y 
L X = end x 

Any multiset which can be produced by this grammar belongs the the List 
type. Reading the grammar rules from right to left gives the underlying rewrite 
system. So alternatively, any multiset which can be reduced using this rewrite 
rules to the singleton List belongs to the List type. The variables in the rules are 
instantiated with addresses in the multiset. L x can be seen as a non-terminal 
standing for a list starting at address x and end x is a one element list. A circular 
list can be defined as follows: 

Circular = L x x 
L X y = L X z, L z y 
L X y = next x y 

Note that the use of different variable names in a rule is significant: two variables 
are instantiated with the same address if and only if the variables have the same 
name. In this definition, L x y is the non terminal for a list starting at position 
X and ending at position y. 

A reaction in Structured Gamma can: 

— test and modify the relations on addresses, 

— test and modify the values associated with addresses. 

Here are some examples of programs operating on lists: 

Sort : List = next a b next a b, a := b,b := a <= a <b 

Mult : List = next a b, next b c next a c,a := a *b 

Iota : List = end a next a b, end b,b := a— l<J=a>l 

Actions are now described as assignments to given addresses. A consumed ad- 
dress which does not occur in the result of the action disappears from the multi- 
set: this is the case for b in the Mult program. On the other hand, new addresses 
can be added to the multiset with their value, like b in the Iota program. Actions 
must also state explicitly how the relations are modified. For instance, the Sort 
does not modify the next relation, but Mult shrinks the list by removing the 
intermediate element b. 

The natural question following the introduction of a new type system con- 
cerns the design of an associated type checking algorithm. In the context of 
Structured Gamma, type checking must ensure that a program maintains the 
underlying structure defined by a type. It amounts to the proof of an invariant 
property. There exists a sound checking algorithm based on the construction of 
an abstract reduction graph. For a reaction C A and type T, the algorithm 
computes the possible contexts X such that X + C reduces to {T} (i.e. belongs 
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to {T}). It is then sufficient to check that X + A (i.e. the multiset after the 
reaction) reduces to {T}. 

Structured Gamma allows the programmer to define his own types and have 
his programs checked according to the type definitions. For example, it is pos- 
sible to check that the three programs above manipulate multisets of type List: 
in other words, the list property is an invariant of the programs. Applications 
of this approach to imperative programming and the analysis of software ar- 
chitectures are described in sections 4.2 and 4.3. It is important to notice that 
this new structuring possibility is obtained without sacrificing the fundamental 
qualities of the language. Gamma programs are just particular cases of Struc- 
tured Gamma programs and Structured Gamma programs can be translated in 
a straightforward way into Gamma. 



3 Gamma as a Bridge 

between Specifications and Implementations 



In the previous section, we presented Gamma as a programming language and 
tried to convey the programming style entailed by the chemical reaction model 
through a series of examples. Gamma can be also seen as a very high-level 
language bridging the gap between specification languages and low-level (imple- 
mentation oriented) languages. 



3.1 Ftom Specifications to Gamma Programs 

We first present the techniques that can be used to prove properties of Gamma 
programs. Then, we suggest how they can be used to derive programs from 
specifications in a systematic way. 

In order to prove the correctness of a program in an imperative language, a 
common practice consists in splitting the property into two parts: the invariant 
which holds during the whole computation, and the variant which is required 
to hold only at the end of the computation. In the case of total correctness, it 
is also necessary to prove that the program must terminate. The important ob- 
servation concerning the variant property is that a Gamma program terminates 
when no more reaction can take place, which means that no tuples of elements 
satisfy the reaction condition. So the obtain the variant of the program by taking 
the negation of the reaction condition. Let us consider as an example the sort 
program introduced in the introduction. The reaction condition corresponds to 
the property: 



3{i,x) e M. 3{j,y) G M. {i > j) and (x < y) 



and its negation 



V(z, x) e M. V(j, y) G M. {i > j) ^ X > y 
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This variant is very informative indeed since it is the well-ordering property. The 
invariant of the program must ensure that the set of indexes and the multiset of 
values are constant. This can be checked by a simple inspection of the action 

0 »} 

It is easy to see that the global invariance follows from the local invariance. 
In order to prove the termination of the program, we have to provide a well- 
founded ordering (an ordering such that there is no infinite descending sequences 
of elements) and to show that the application of an action decreases the multiset 
according to this ordering. To this aim, we can resort to a result from [26] 
allowing the derivation of a well-founded ordering on multisets from a well- 
founded ordering on elements of the multiset. Let be an ordering on V and 
3> be the ordering on Multisets(V) defined in the following way: 

M M' 

3X,Y S Multisetsiy). X 0 and 

X C M and M' = (M — X) +Y and (Vy G Y. 3x € X. x >- y) 

The ordering ^ on Multisets(V) is well-founded if and only if the ordering 
on y is well-founded. This result is fortunate because the definition of ^ 
precisely mimics the behavior of Gamma (removing elements from the multiset 
and inserting new elements). The significance of this result is that it allows us 
to reduce the proof of termination, which is essentially a global property, to a 
local condition. In order to prove the termination of the sort program, we can 
use the following ordering on the elements of the multiset: 

(i,x) Q {i',x') {i > i' and x' > x) 

It is easy to see that this ordering is well-founded (the set of indexes and the 
multiset of values are finite), so the corresponding multiset ordering is also well- 
founded. We are left with the proof that for each value produced by the action, 
we can find a consumed value which is strictly greater. To prove this we observe 
that: 

(i, y) □ (j, y) and (j, x) C (j, y) 

This concludes the correctness proof of the sort program. 

Rather than proving a program a posteriori, it may be more appropriate to 
start from a specification and try to construct the program systematically. The 
derived program is then correct by construction. A method for the derivation of 
Gamma programs from specifications in first order logic is proposed in [8] . The 
basic strategy consists in splitting the specification into a conjunction of two 
properties which will play the roles of the invariant and the variant of the pro- 
gram to be derived. The invariant is chosen as the part of the specification which 
is satisfied by the input multiset (or that can be established by an initialization 
program). If the variant involves only V quantifiers than its negation yields the 
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reaction condition of the program directly. The technique for deriving the action 
consists in validating the variant locally while maintaining the invariant. These 
two constraints are very often strong enough to guide the construction of the ac- 
tion. Let us consider as an example the rem program in the definition of primes 
in section 2.1. The input multiset is {2, . . . , n} and a possible specification of the 
result M is the following: 

MC{2,...,n} (1) 

Va; G {2, . . . , n}. (Vy G {2, . . . , n}. ^multiple{x, y)) a: G M (2) 

'ix,y G M. ^multiple{x,y) (3) 

Both properties (1) and (2) are satisfied by the input multiset {2, . . . , n}, so the 
invariant is defined as / = (1) A (2) and the variant is = (3). The negation of 
the variant is 

3a;,?/ G M. multiple{x,y) 

which yields to the reaction condition multiple{x , y) . The action must satisfy 
the invariant which means that no value outside {2,...,n} can be added to 
the multiset and no value should be removed from the multiset unless it is the 
multiple of another value. On the other hand, the action should establish the 
variant locally which means that the returned values should not contain any 
multiples. So the action cannot return both x and y and it cannot remove y. 
the only possibility is to return y; this action satisfies all the conditions and the 
derived program is: 

rem = x, y ^ y ^ multiple{x, y) 

The interested reader can find a more complete treatment of several exam- 
ples in [8] . A slightly different approach is taken in [52] which introduces a very 
general form of specification and the derivation of the corresponding Gamma 
program. It is then shown that a number of classical and apparently unrelated 
problems (the knapsack, the shortest paths, the maximum segment sum and the 
longest up-sequences problems) turn out to be instances of the generic speci- 
fication. The generic derivation can then be instantiated to these applications, 
yielding the corresponding Gamma programs. 

Let us stress the pervasive influence of the locality principle (stated in the 
introduction) in the correctness proofs and the derivations. Each part of the 
correctness proof of the sort program sketched above exploits this feature by 
reducing the global reasoning (manipulation of properties of the whole multiset) 
to a local reasoning (on the elements involved in a single reaction) . 

3.2 Prom Gamma Programs to Efficient Implementations 

As mentioned earlier, the philosophy of Gamma is to introduce a clear separation 
between correctness issues and efficiency issues in program design. In particu- 
lar, Gamma can be seen as a specification language which does not introduce 
unnecessary sequentiality. As a consequence, designing a reasonably efficient im- 
plementation of the language is not straightforward. This section outlines several 




Gamma and the Chemical Reaction Model: Fifteen Years After 



31 



optimizations allowing Gamma programs to be refined into efficient, sequential 
programs. 

Consider a very simple form of Gamma program: 

f{xi, . . . , X„) R{xi, ...,Xn) 

A straightforward implementation of this program can be described by the fol- 
lowing imperative program: 

While tuples remain to be processed 

do 

choose a tuple {xi, . . . , x„) not yet processed; 
if R{xi, . . . , Xn) then 

(1) remove xi, . . . ,Xn from M 

(2) replace them by f{xi, . . . , Xn) 

end 

This very naive implementation puts forward most of the problems which have 
to be tackled in order to produce a Gamma implementation with a realistic 
complexity. The hardest problem concerns the construction of all tuples to be 
checked for reaction. A blind approach to this problem leads to an untractable 
complexity but a thorough analysis of the possible relationships between the 
elements of the multiset and the shape of the reaction condition may lead to 
improvements which highly optimize the execution and produce acceptable per- 
formances. In his thesis, G. Greveuil [22] studied several optimizations which are 
summarized here. 

One important source of inefficiency comes from useless (redundant or deemed 
to fail) checks of the reaction condition. Three optimizations can dramatically 
reduce the overhead resulting from this redundancy: 

1. Decomposition of the reaction condition: instead of considering R as 
a whole, one may decompose it as a conjunction of simpler conditions like: 

R{xi, . . . ,Xn) = Rl(xi) A R2(xi,X2) A ... A Rn{xi,X 2 , ■ ■ ■ , Xn) 

The test for condition R is done incrementally, avoiding the construction of 
tuples whose prefix does not satisfy one Ri. 

2. Detection of neighborhood relationships: the analysis of the reaction 
condition may provide information which can be used to limit the search 
space. For example, it may happen from the reaction condition that only 
adjacent values can react, or that only values possessing a common “flag” can 
be confronted. These properties can be detected at compile time and, in some 
situations (some sorting examples, pattern detection in image processing 
applications), the run-time improvement is considerable. 

3. Control of the non-determinism: The Gamma paradigm imposes no con- 
straint on the way tuples are formed. [22] shows that limiting non-determinism 
by imposing an ordering in the choice of values to be checked against the 
reaction condition can be very fruitful. 
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An interesting conclusion of the work described in [22] is that well-known 
efficient versions of sequential algorithms (shortest path for instance) can be 
“rediscovered” and justified as the result of several optimizations of a naiVe 
implementation of Gamma. It is very often the case that the most drastic op- 
timizations rely on structural properties of the values belonging to the multiset 
(neighborhood relationship, ordering in the choice of values . . . ) . 

Such properties are difficult to find automatically and the optimizations de- 
scribed above can be seen as further refinement steps rather than compilation 
techniques. Several proposals have been made to enrich Gamma with features 
which could be exploited by a compiler to reduce the overhead associated with 
the “magic stirring” process. For example, the language of schedules [17,18] pro- 
vides extra information about control in Gamma programs, and loeal linear logic 
[48] as well as Structured Gamma [31] structure the multiset. 

4 Gamma as of Source of Inspiration 

The chemical reaction model has served as the basis of a number of works in var- 
ious, often unexpected, research directions. In particular, ideas borrowed from 
Gamma have been applied to process calculi, imperative programming and soft- 
ware architectures. We describe these three developments in turn and conclude 
with a sketch of a few other proposals. Most of these works are quite significant 
and open new research directions but it is important to note that none of them 
jeopardizes the fundamental characteristics of the model which is the expres- 
sion of computation as “the global result of the successive applications of local, 
independent, atomic reactions” . 

4.1 The Chemical Abstract Machine 

The chemical abstract machine (or Cham) was proposed by Berry and Boudol 
[12] to describe the operational semantics of process calculi. The most impor- 
tant additions to Gamma are the notions of membrane and airlock mechanism. 
Membranes are used to encapsulate solutions and to force reactions to occur 
locally. In terms of multisets, a membrane can be used to introduce multiset of 
molecules inside a multiset that is to say “to transform a solution into a sin- 
gle molecule” [13]. The airlock mechanism is used to describe communications 
between an encapsulated solution and its environment. The reversible airlock 
operator < extracts a element m of a solution {m, mi, . . . , m„}: 

{m,mi, . . . ,m„} ^ {m<i {mi, . . . ,m„}} 

The new molecule can react as a whole while the sub-solution jmi, . . . , m„} is 
allowed to continue its internal reactions. So the main role of the airlock is to 
allow one molecule to be visible from outside the membrane and thus to take part 
in a reaction in the embedding solution. The need for membranes and airlocks 
emerged from the description of GGS [50] in Gham and especially the treatment 
of the restriction operation (which restricts the communication capabilities of a 
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process to labels different from a particular value a). The computation rules of 
the Cham are classified into general laws and two classes of rules: 

~ The general laws include the chemical law and the membrane law: 

S' 

S + S” ^ S' + S” 

S^ S' 

{C[S]} ^ {C[S']} 

The former shows that reactions can be performed freely within any solution, 
which captures the locality principle. The latter allows reactions to take place 
within a membrane (C'[S'] denotes any context of a solution S). 

— The first class of rules corresponds to the proper reaction rules similar to the 
rules presented so far in the paper. The definition of a specific Cham requires 
the specification of a syntax for molecules and the associated reaction rules. 
As an example, molecules can be CCS processes and the rule corresponding 
to communication in CCS would be: 

a.P , a.Q 1 -^ P , Q 

— The second kind of rules are called structural and they are reversible. They 
can be decomposed into two inverse relations ^ and ^ called respectively 
heating and cooling rules. The first ones break complex molecules into smaller 
ones, preparing them for future reactions, and the second ones rebuild heavy 
molecules from light ones. Continuing the CCS example, we have the struc- 
tural rule: 

{p\Q);^p, Q 

where | is the CCS parallel composition operator. 

The Cham was used in [12] to define the semantics of various process calculi 
(TCCS, Milner’s 7r-calculus of mobile processes) and a concurrent lambda cal- 
culus. A Cham for the call-by-need reduction strategy of A-calculus is defined in 
[13]. The Cham has inspired a number of other contributions. Let us mention 
some of them: 

— [1] uses a linear Cham to describe the operational semantics of proof expres- 
sions for the classical linear logic. 

— [51] defines an operational semantics of the 7r-calculus in a Cham style. 

~ [39] describes a graph reduction in terms of a Cham. 

— [43] applies the Cham in the context of the Facile implementation. 

The Cham approach illustrates the significance of multisets and their connec- 
tion with concurrency. The fact that multisets are inherently unordered makes 
them suitable as a basis for modeling concurrency which is an essentially asso- 
ciative and commutative notion. As stated in [12]: “In the SOS style of seman- 
tics, labeled transitions are necessary to overcome the rigidity of syntax when 
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performing communications between two syntactically distant agents. . . . On the 
contrary, in the Cham, we just make the syntactic distance vanish by putting 
molecules into contact when they want to communicate, and their communica- 
tion is direct. ” As a consequence, this makes it possible to bring the semantics 
of concurrent systems closer to the execution process of sequential languages, or 
the evaluation mechanism of functional languages [13]. 

4.2 Shape Types 

Type systems currently available for imperative languages are too weak to de- 
tect a significant class of programming errors. The main reason is that they fail 
to capture properties about the sharing which is inherent in many data struc- 
tures used in efficient imperative programs. As an illustration, it is impossible to 
express the property that a list is doubly-linked or circular in existing type sys- 
tems. The work around Structured Gamma (section 2.5) showed that many data 
structures could be described as graph grammars and manipulated by reactions. 
Furthermore, a static algorithm can be used to check that a reaction preserve 
the structure specified by the grammar. These ideas and techniques have been 
adapted in order to extend the type system of C and make pointer manipulation 
safer [30]. 

Shape-C is an extension of C which integrates the notion of types as graph 
grammars (called here shapes) and reactions. The notion of graph grammars 
is powerful enough to describe most complex data structures (see [30] for a 
description of skip lists, red-black trees, left-child-right-sibling trees in terms of 
graph grammars). 

The design of Shape-C was guided by the following criteria: 

— the extensions should be blended with other C features and be natural 
enough for C programmers, 

— the result of the translation of Shape-C into pure C should be efficient, 

— the checking algorithm of section 2.5 should be applicable to ensure shape 
invariance. 

We present Shape-C through an example: the Josephus program. This pro- 
gram, borrowed from [56] , first builds a circular list of n integers; then it proceeds 
through the list, counting through m — 1 items and deleting the next one, until 
only one is left (which points to itself). Figure 1 displays the program in Shape- 
C. The Josephus program first declares a shape cir denoting a circular list of 
integers with a pointer pt. Besides cosmetic differences, the definition of shapes 
is similar to the context free grammars presented in Section 2.5. The variables 
are interpreted as addresses. They possess a value whose type must be declared 
(here int). Values can be tested or updated but cannot refer to addresses. They 
do not have any impact on shape types. 

Intuitively, unary relations (here pt) correspond to roots whereas binary 
relations (here next) represent pointer fields. Shapes can be translated into C 
structures with a value field and as many fields (of pointer type) as the shape 
has binary relations. 
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/* Integer circular list */ 

shape int cir { pt x, L x x; 

Lxy=Lxz, Lzy; 

L X y = next x y ; } ; 

mainO 

■fint i, n, m; 

/♦initialization to a one element circular list */ 

cir s = [| => pt x; next x x; $x=l; |]; 

scan! (""/od’/,d" , &n, &m) ; 

/* Building the circular list l->2-> . . . ->n->l */ 

for (i = n; i > 1; i — ) 

s: [| pt x; next x y; => pt x; next x z; next z y; $z=i; |] ; 

/* Printing and deleting the m th element until only one is left */ 

while (s:[| pt x; next x y; x != y; => |]) 

{ 

for (i = 1; i < m-1; ++i) 

s: [| pt x; next x y; => pt y; next x y; |] ; 
s:[| pt x; next x y; next y z; => pt z; next x z; printf(""/,d I] 

} 

/* Printing the last element */ 

s:[| pt X => pt x; printf (""/od\n" ,$x) ; |]; 

> 

Fig. 1. Josephus Program 

Shape-C uses only a subset of graph grammars that corresponds to the rooted 
pointer structures manipulated in imperative languages. This subset is defined 
by the following properties: 

(51) Relations are either unary or binary. 

(52) Each unary relation is satisfied by exactly one address in the shape. 

(53) Binary relations are functions. 

(54) The whole shape can be traversed starting from its roots. 

These conditions allow shapes to be implemented by simple C structures 
(with a value and pointer fields) . They can be enforced by analyzing the definition 
of grammars. 

The reaction, written [ I C => A I ] , is the main operation on shapes. Two 
specialized versions of reactions are also provided: initializers, with only an ac- 
tion, written [ I => A I ] and tests, with only a condition, written [ I C => I ] . 

The Josephus program declares a local variable s of shape cir and initializes 
it to a one element circular list. 

cir s = [| => pt x; next x x; $x = 1 ; I] ; 

The value of address x is written $x and is initialized to 1. In general, actions 
may include arbitrary C-expressions involving values. The for- loop builds a n 
element circular list using the reaction 

s: [I pt x; next x y; => pt x; next x z; next z y; $z=i; |] ; 
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The condition selects the address x pointed to by pt and its successor. The 
action inserts a new address z and initializes it to i. The translation in pure C 
is local and applied to each shape operation of the program. Shape-C enforces 
a few simple restrictions on reactions so that the translation is both direct and 
efficient. 

Shape checking amounts to verify that initializations and reactions preserve 
the shape of objects. The checking algorithm is directly based on the algorithm 
outlined in section 2.5. Note that values and expressions on values are not rele- 
vant for shape checking purposes. Using this algorithm, it is easy to ensure that 
the list s is cyclic throughout the Josephus program. 

Due to their precise characterization of data structures, shape types are a 
very useful facility for the construction of safe programs. Most efficient versions 
of algorithms are based on complex data structures which must be maintained 
throughout the execution of the program [16,56]. The manipulation of these 
structures is an error-prone activity. Shape types permits to describe invariants 
of their representation in a natural way and have them automatically verified. 



4.3 Software Architectures 

Another related area of application which has attracted a great amount of in- 
terest during the last decade is the formal definition of software architectures. 
As stated in [2], “Software systems become more complex and the overall sys- 
tem structure - or software architecture - becomes a central design problem. An 
important step towards an engineering discipline of software is a formal basis 
for describing and analyzing these designs”. Typical examples of software ar- 
chitectures are the “client-server organization”, “layered systems”, “blackboard 
architecture”. Despite the popularity of this topic, little attention has focused 
on methods for comparing software architectures or proving that they satisfy 
certain properties. One major reason which makes these tasks difficult is the 
lack of common and formally based language for describing software architec- 
tures. These descriptions are typically expressed informally with box and lines 
drawings indicating the global organization of computational entities and the 
interactions between them [2]. The chemical reaction model has been used for 
specifying software architectures [38] and architecture styles [42]. 

Software architecture specification. The application considered in [38] is 
a multi-phase compiler and two architectures are defined using the “chemical 
abstract machine” [12,13]. The different phases of the compiler are called lexer, 
parser, semantor, optimiser and generator. An initial phase called text generates 
the source text. The types of the data elements circulating in the architecture are 
char, tok, phr, cophr, obj. The elements of the multiset have one of the following 
forms: 

ifti) o 0 (^ 2 ) o phase 
ofti) o phase o i{t 2 ) 
phase o i{ti) o o{t 2 ) 
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where o is a free constructor, ti and t2 represent data types and phase is one 
of the phases mentioned above. An element starting with i(ti) (resp. o(ti)) cor- 
responds to a phase which is consuming inputs (resp. producing outputs). An 
element starting with phase is not ready to interact. So i{ti) and o(ti) can be 
seen as ports defining the communications which can take place in a given state. 

The following is a typical reaction in the definition of an architecture: 

i{di) o 0(^2) o wi, o(di) o m2 o iid^) 

0(^2) o o i{di), m2 o iid^) o o{di) 

This rule describes pairwise communication between processing elements: mi 
consumes input di produced as output by another processing element m2. For 
example, the reaction: 

i{tok) o o{phr) o parser, o{tok) o lexer o i{char) 

o{phr) o parser o i{tok), lexer o i{char) o o{tok) 

represents the consumption by the parser of tokens produced by the lexer. At 
the end of the reaction, the parser is ready to produce its output and the lexer is 
inert because it has completed its job. In fact another reaction may be applied 
later to make it active again to process another piece of text. 

One major benefit of the approach is that it makes it possible to define 
several architectures for a given application and compare them in a formal way. 
As an example, [ 38 ] defines a correspondence between multisets generated by 
two versions of the multi-phase compiler and establishes a form of bisimulation 
between the two architectures. They also prove normalization properties of the 
programs. 

Software architecture styles. The approach described in [ 42 ] focuses on the 
interconnection between individual components of the software architecture. The 
main goal is to describe architecture styles (i.e. classes of architectures) and to 
check that the dynamic evolution of an architecture preserves the constraints 
imposed by the style. Techniques developed for Structured Gamma (graph gram- 
mars and the associated checking algorithm) can be applied to this problem. 

Structured Gamma allows connections to become “first class” objects and 
to prove invariance properties on the structure of the network. For example, a 
client-server architecture style can be defined as the graph grammar 

ClientServer = CS m 

CS m = cr c m, ca m c, C c, CS m 

CS m = sr m s, sa s m, S s, CS m 

CS m = M m, X X 

The unary relations C, S, M and X correspond respectively to client, server, 
manager and external entities. The external entity stands for the external world; 
it records requests for new clients wanting to be registered in the system. The 
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binary relations cr and ca correspond to client request links and client answer 
links respectively (sr and sa are the dual links for servers). For example, the 
architecture 





involves two clients ci and C 2 , two servers si and S 2 , a manager mo and the 
external entity xq- It belongs to the client-server class (grammar) ClientServer. 

It is often the case that the architecture of an application should be able to 
evolve dynamically. For instance, a client-server organization must allow for the 
introduction of new clients or their departure, a pipeline may grow or shrink, 
facilities for dealing with mobile computing may be required. In this framework, 
the evolution of the architecture is defined by a coordinator. The task of the 
coordinator is expressed by reaction rules. As an illustration, the following co- 
ordinator applies to a client-server architecture: 

X a;, 'M. m —> li. x' , Mm, cr c m, ca m c, C c 



cr c m, ca m c, C c — > 0 

The two rules^ describe respectively the introduction of a new client in the 
architecture and its departure. The main benefit of this approach is that the 
algorithm of section 2.5 can be used to ensure that a coordinator does not break 
the constraints of the architecture style. This makes it possible to reconcile 
a dynamic view of the architecture with the possibility of static checking. For 
example, had we forgotten, say cr c m in the right-hand side of the first rule, then 
the coordinator would have been able to transform a client-server architecture 
into an architecture which would not belong any longer to the class defined by 
ClientServer . 

The interested reader can find in [37] the treatment of an industrial case 
study proposed by the Signaal company [25] using a multiple views extension 

^ In fact, these rules are completed with side conditions on the states of the entities 
otherwise, the coordinator could add or remove entities without any consideration 
of the current state of the system. 
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of the formalism presented above. The goal of the work was the specification 
of a railway network system. The static verification of coordination rules with 
respect to grammars was deemed the most attractive feature of the formalism 
[37]. 

These applications provide evidence that (Structured) Gamma is an intuitive 
and formally based formalism to describe and analyze software architectures. 

4.4 Other Works 

Influences of the chemical reaction model can be found in other domains such 
as visual languages [36], protocols for shared virtual memories [49] or logic pro- 
gramming [19,61]. We just sketch here works around coherence protocols and 
logic programming where Gamma played a key role. 

Formalization of coherence protocols. Goherence protocols for shared vir- 
tual memories have been formalized as Gamma programs in [49]. The multiset, 
whose elements are the protocol entities or events, represents a global view of 
the system. The protocol itself is described as a Gamma program (i.e. a collec- 
tion of reaction rules). Besides, a fragment of first-order logic is used to specify 
properties that the protocol is expected to satisfy. This formalization made it 
possible to design an algorithm checking that properties are indeed invariants of 
the protocol. [49] presents a Gamma formalization of the Li and Hudak protocol 
[44] as well as the automatic verification of a collection of invariants. 

This approach has been applied to software architectures in [53] . The dynamic 
evolution of the architecture is described as a Gamma program (as in section 4.3). 
Instead of checking membership to a given style (as in section 4.3), the algorithm 
of [49] is reused to check that the evolutions of the architecture respect some 
logical properties specified in a separate language. 

Gamma and logic programming. Several proposals have been made for in- 
tegrating Gamma and logic programming languages. A first approach, followed 
in [19], uses multisets of terms and describes conditions and actions as pred- 
icates. This model is implemented as an extension of Godel, a strongly typed 
logic programming language with a rich module system. It involves a definition 
of multiset unification and a careful integration with the operational semantics 
of Gamma, in which the “choices” made by reaction conditions are not back- 
trackable. This extension, called Gammalog, includes the sequential and parallel 
composition operators introduced in [34]. Another approach is followed in [61] 
where the objects in the multisets are goal formulas and the (condition, action) 
pairs are goal-directed deduction rules. This results in ALO, an extension of LO 
[3] which can itself be seen as an elaboration on the basic chemical reaction 
model. ALO can be seen as a higher-order extension of LO in the same way 
as AProlog is a higher-order extension of Prolog. Implication in goals provides 
the ability to construct (or augment) the program at run-time and the use of 
multisets leads to a uniform treatment of programs and data. 
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5 Conclusion 

A number of languages and formalisms bearing similarities with the chemical 
reaction paradigm have been proposed in the literature. Let us briefly review 
the most significant ones: 

— A programming notation called associons is introduced in [54]. Essentially 
an associon is a tuple of names defining a relation between entities. The 
state can be changed by the creation of new associons representing new 
relations derived from the existing ones. In contrast with Gamma, the model 
is deterministic and does not satisfy the locality properties (dues to the 
presence of V properties) . 

— A Unity program [14] is basically a set of multiple-assignment statements. 
Program execution consists in selecting non deterministically (but following 
a fairness condition) some assignment statement, executing it and repeating 
forever. [14] defines a temporal logic for the language and the associated proof 
system is used for the systematic development of parallel programs. Some 
Unity programs look very much like Gamma programs (an example is the 
exchange sort program presented in the introduction) . The main departures 
from Gamma is the use of the array as the basic data structure and the ab- 
sence of locality property. On the other hand. Unity allows the programmer 
to distinguish between synchronous and asynchronous computations which 
makes it more suitable as an effective programming languages for parallel 
machines. In the same vein as Unity, the action systems presented in [5] are 
do-od programs consisting of a collection of guarded atomic actions, which 
are executed nondeterministically so long as some guard remains true. 

— Linda [32,15] contains a few simple commands operating on a tuple space. 
A producer can add a value to the tuple space; a consumer can read (de- 
structively or not) a value from the tuple space. Linda is a very elegant 
communication model which can easily be incorporated into existing pro- 
gramming languages. 

— LO (for Linear Objects) was originally proposed as an integration of logic 
programming and object-oriented programming [3]. It can be seen as an 
extension of Prolog with formulae having multiple heads. From an object- 
oriented point of view, such formulae are used to implement methods. A 
method can be selected if its head matches the goal corresponding to the 
object in its current state. The head of a formula can also be seen as the 
set of resources consumed by the application of the method (and the tail 
is the set of resources produced by the method). In [4], LO is used as a 
foundation for interaction abstract machines, extending the chemical reac- 
tion metaphor with a notion of broadcast communication: sub-solutions (or 
“agents” ) can be created dynamically and reactions can have the extra effect 
of broadcasting a value to all the agents. 

Taking a dual perspective, it is interesting to note that the physical modeling 
community has borrowed concepts from computer science leading to formalisms 
which bear similarities with higher-order Gamma. An example of this trend of 
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activity is the “Turing gas” [29] where molecules float at random in a solution, 
reacting if they come into contact with each other. A language akin to lambda- 
calculus is used to express the symbolic computation involved in the reactions. 

As a conclusion, we hope that this paper has shown that the chemical reac- 
tion model is a particularly simple and fruitful paradigm. No doubt that new 
surprising developments are yet to come. 
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Abstract. Diagrammatic notations, such as Venn diagrams, Petri-Nets 
and finite state automata, are in common use in mathematics and com- 
puter science. While the semantic domain of such systems is usually well 
formalized, the visual notation itself seldom is, so that they cannot be 
used as valid devices of formal reasoning. A complete formalization of 
such notations requires the construction of diagram systems with rigor- 
ously defined syntax and semantics. We discuss how diagram specifica- 
tion can be interpreted as multiset rewriting and, based on this, how it 
can be formalized in linear logic. We discuss the power of our approach 
through an illustration of its possible extension with reflective capabil- 
ities to manage negative conditions, and through the identification of a 
class of diagrammatic transformations which can be directly expressed 
in our framework. 



1 Introduction 

A common way of modelling the world is as a collection of entities and rela- 
tions. Such a paradigm, though adequate to build static definitions of a domain, 
presents difficulties in modelling dynamics. In particular, the frame problem 
arises when modelling syntactic transformation of diagrams. 

Syntactic diagram transformations constitute the fundamental tool for per- 
forming resoning with diagrammatic representations and visual simulation of 
processes or systems. The main problem is that, graphical sentences being two- 
dimensional, they cannot be constrained to support only a privileged relation, 
such as the prefix relation of one dimensional representations, e.g. lists, strings, 
etc. Nor can they be restricted to support only partial orders, such as the 
ancestor-descendent relation in a tree. Therefore, the direct extension to the 
two dimensional world of images of techniques developed for one dimensional al- 
gebraic structures falls short in describing dynamic aspects of visual sentences. 

Graphs have been proposed as a data type adequately supporting the rep- 
resentation of entities (nodes) and relations (edges), and able to express the 
abstract syntax and semantics of diagrams [BMST99]. Several approaches to 
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graph transformation have been developed. However, these approaches are not 
free from problems, as will be discussed later, and are not adequate to model 
the insurgence of new relations as a consequence of the modification of a part of 
a diagram. 

What is needed is a formalism which does not commit to any specific inter- 
pretation or reading convention of images, i.e. that does not fix the set of spatial 
relations that are considered once and for all, and that supports a uniform way 
for the management of different spatial relations between significant structures 
in an image. We argue that multisets are an adequate formalism in which to 
set the management of visual entities and that linear logic provides an adequate 
calculus for diagram transformation. 

In particular, we demonstrate that a small fragment of linear logic is ade- 
quate for expressing the parsing of diagrammatic languages, through which we 
can build the meaning of a visual sentence. We argue that this same fragment 
characterises a class of diagram transformations encompassing several types of 
executable diagrammatic specifications employed in Software Engineering, such 
as Petri Nets or Transition Systems, as well as typical transformations employed 
in diagrammatic reasoning. 

The identification of such a fragment provides an advantage in that it allows 
the expression in a uniform way of several calculi developed ad hoc for the 
different types of visual language. 



2 Representing Diagrams as Mnltisets 

The first step to formalizing diagram transformation systems is to find a proper 
representation for static diagrams. Diagrams can be considered as collections of 
graphical tokens such as lines, circles, etc. A fundamental distinction between 
diagrammatic languages and string languages is that no single linear order is 
inherently defined on diagram elements. Hence, diagrammatic expressions must 
be considered as non-linear structures. Typical algebraic non-linear structures 
used to support diagram management are graphs and multisets, rather than 
sequences of tokens. Diagram transformation can then be interpreted as rewrit- 
ing of such structures. In this paper we describe a multiset-based approach, 
i.e. we view the tokens as resources that can be produced, consumed, queried 
and changed. Linear logic presents itself naturally as a framework for studying 
diagram transformations based upon such a model. 

The objects in a multiset that represents a diagram must contain a represen- 
tation of the diagram geometry. Moreover, a proper typing mechanism is also 
desirable. For example, on the most elementary level, the square of Figure 1 
could be represented as the multiset {/me(140, 5), ^me(140, 85), line{220, 85), 
line{220, 5)}. However, we might want to model objects at a higher level of 
abstraction, so that the entire square in the figure is represented as a single ob- 
ject {s(7uare(140, 5, 80)}. The construction of this new type from the primitives 
could be defined by a simple rule and parsing could be used to construct the 
higher-level presentation. 
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For the purpose of this article, we adopt a view in which meaningful types of 
graphical entities are given by specific graphical data types. We regard their con- 
struction as encapsulated in abstract constructors so that the process by which 
they can be recognized is hidden. From a technical perspective, the recognition 
of these data types from a diagram can be done in various forms, for example 
by transformation of prototypes [BCM99], by parsing [MMW98], with algebraic 
specifications [BMST99] or using description logic [Haa98] . 

A strong requirement to be imposed on the graphical data type system is 
that type assignment must be unambiguous. For example, in a system where 
both a rectangle and a square data type exist, rectangles with equal sides must 
consistently be considered as squares. Inheritance can be used to model the 
relation between these types [Haa98]. 

An important requirement for the transforma- 
tion mechanism is implied: A transformation which 
modifies the geometry of an element so that it is 
incompatible with its original data type must also 
change the type of this element accordingly. In the Fig. 1. A simple diagram, 
rest of this paper we assume that these conditions 
are met by the type systems and the transforma- 
tion rules adopted in the examples. For discussions 
of the limits of these assumptions see [BCM99,BMST99,MMW98]. 

The second question to be addressed is how to represent the spatial rela- 
tions between the diagrammatic objects. Several alternatives for this have been 
considered in the literature: 

Explicit representation of relations: The relations existing between to- 
kens are reified and inserted in the multiset as tokens in themselves. In this 
case, the diagram of Figure 1 could be represented as {letteri{ “a”, 35, 45, 10, 
10), sgMarei(0, 0, 80), czrc^ei(40, 40, 20), inside{letteri, circlei), inside{circlei, 
squarei)}. 

Use of constraints: The multiset M is combined with a set of constraints C 
that describes the relevant spatial relations expressed as (usually arithmetic) 
constraints on attributes of the objects. The pair (M, C) representing the dia- 
gram of Figure 1 could be described by {letter {^^d\xi),circle{x 2 ),square{x 3 )} 
together with the constraints {inside{xi,X 2 ) A inside{x 2 , a^a)}. Since attributes 
are used to represent the geometries, spatial relations can be queried by inspec- 
tion of this geometry, e.g. by testing constraint entailment or consistency. These 
functions are realized by a constraint solver for the underlying domain. 

Use of concrete geometries: For this representation, tokens with geometric 
attributes are used as in the case of the constraint representation. However, 
attributes are always given concrete values. In this case, the representation of 
Figure 1 would be {letter{“a” , 35, 45, 10, 10), square{0, 0, 80), circle{A0, 40, 
20 )}. 

The choice of representation has some important implications related to the 
frame problem. Consider the transformation rule depicted in Figure 2 which ex- 
presses the removal of a circle object. It corresponds to a multiset transformation 
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rule which could informally be written as: 

circlei{-, _), square\{X , Y, Z) , inside{circlei , squarei) square\{X, Y, Z) 

Applying this rule to Figure 1 we are left with the problem of what to do with 
the inside relation between the letter and the circle. Since the circle no longer 
exists, it should be removed, but now an inside relation should hold between the 
letter and the square. This causes problems with the first two representations. 

The fundamental difference in the way these representations handle the prob- 
lem is this: In the first representation, the inside relation is represented explicitly 
by an uninterpreted, symbolic relation. Any transformation must therefore ex- 
plicitly handle such relations and relevant change in the spatial relations must be 
propagated explicitly. Essentially, one needs to re-implement the relevant frag- 
ment of geometry, which makes the specification cumbersome and error-prone. 

In the constraint-based representa- 
tion, the inside relation is managed by 

mapping the geometry to the underly- O ^ 

ing arithmetic domain and by having 

a constraint solver handle the arith- Fig. 2. Simple Transformation, 

metic constraint theory. The propaga- 
tion of spatial constraints happens for 

free, because the corresponding arithmetic constraints are propagated automati- 
cally by this solver. However, in the context of transformation we are now facing 
a new problem, which stems from the fact that an appropriate constraint solver 
must work incrementally. The full set of constraints is only evaluated once for the 
initial diagram. After this, the constraint store should be adapted incrementally 
during the transformation, i.e. only those spatial constraints that change should 
be updated, added or deleted. The problem arises, because constraints in the 
store have to be kept in a solved form, so that it is difficult to perform a mean- 
ingful constraint deletion. Even if a single constraint can explicitly be removed, 
it is difficult to remove all implied constraints. As a simple example, consider 
the existence of three attributes a, b, c. When the constraints a = b A b = c 
are asserted, a = c is automatically derived. Even if we remove b = c from 
the constraint store, a = c would still be in the store, but it would no longer 
be justified. Transformation of constraint diagram representations therefore re- 
quires a re-evaluation of the existing spatial relations, which is inefficient. These 
problems have been explored in [Mey97]. 

These kinds of problem make it difficult to provide a linear logic character- 
ization of diagram transformation based on either of these representations. For 
this reason, we adopt the representation based on concrete geometry here, and 
we assume the existence of a specialized geometry agent which is able to answer 
queries regarding spatial relations in a given diagram by performing arithmetic 
calculations based on its concrete geometry. 
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3 Diagram Transformation as Mnltiset Rewriting 

We now move from discussing the representation of diagrams as multisets to 
investigating diagrammatic transformations through multiset transformations. 
In general, such diagram transformations detail how one diagram can syntacti- 
cally be transformed into another diagram that directly and explicitly exhibits 
additional information that is either not present or only implicit and hidden in 
the prior diagram. This transformation can be understood as multiset trans- 
formation or rewriting. This is not a completely new idea, since whereas the 
specification of diagram transformation has often been based on graph gram- 
mars [BMST99], other approaches have also used type-0 variants of attributed 
multiset grammars [MMW98,MM00]. 

Even though different formal theories for multi-dimensional grammars have 
been developed, there is no grammar calculus that would allow us to deduce 
soundness or completeness results for diagrammatic calculi on the basis of their 
grammar specifications. It is for these reasons that logic has been explored as an 
alternative tool for diagram rewriting. The first obvious choice to explore is clas- 
sical first-order logic. Roughly speaking, two different embeddings are possible: 
Either the objects of the diagram and their relations are represented as predicates 
or, alternatively, they can be modelled as term structures. For diagram rewrit- 
ing, the first type of embedding has been demonstrated in [HM91,Mar94,Mey00]. 
The second approach is closely related to modelling string language grammars 
in logic programming by Definite Clause Grammars and to their extension 
in the form of Definite Clause Set Grammars [Tan91]. Variations for the use 
of diagram parsing and/or rewriting have been demonstrated and discussed 
in [HM091,Mey97,MMW98]. 

Both embeddings of diagram rewriting into first-order logic have drawbacks 
that make their universal utility questionable. In the first embedding (graphical 
entities as predicates) a typical rewrite step needs to add as well as to remove 
objects, which amounts to deriving new predicates (conclusions) and deleting 
old conclusions. This is not possible in classical first order logic due to its mono- 
tonicity. In the second embedding there are no restrictions on how the diagram 
could be rewritten, but the modelling does not leverage from the structure of 
the underlying logic anymore, since, essentially, this is “abused” as a rule-based 
rewrite mechanism. In contrast, what is really desirable is that the logical deriva- 
tion relation can directly deal with terms representing the graphical elements, 
so that a direct correspondence between logical derivations and those in the 
diagram system exists. 

A detailed analysis of these problems has recently been given for the first 
time in [MMOO], where the use of linear logic [Gir87,Gir91] is advocated instead. 
The major advantage of linear logic over classical logic in the context of diagram 
rewriting is that it is a logic of resources and actions, and therefore adequately 
models a non-monoton ous rewrite process. Linear implication models the fact 
that the left-hand side resources (the antecedent) are used and consumed (sic) 
in the process of producing the right-hand side resources (the consequent). This 
is exactly the process of multiset rewriting that we have to model. 
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The question arises which fragment of linear logic we should use. One needs to 
use multiplicative connectives to express the simultaneous existence of elements 
in the multiset of diagram elements, while linear implication is adequate to 
express the rewriting as such. Since all the elements in a multiset of objects 
representing a diagram have to exist simultaneously, a natural choice is to use 
multiplicative conjunction (®) to model their union in a multiset. Therefore, we 
might decide to express, for instance, the transformation rule in Figure 2 as 

circle{xi) ® square{x 2 ) ® inside{x ^ , x 2 ) square{x 2 ) 

This choice, advocated in [MMOO], seems natural and conceptually correct. How- 
ever, we are not interested in modelling diagram rewriting in linear logic for its 
theoretical properties alone, but we are also interested in linear logic as a well- 
founded theoretical framework for declarative computational implementations 
of diagram transformation. Therefore, we have to pay attention to whether the 
chosen fragment is adequate as the basis of implementations. 



3.1 LO and Interaction Abstract Machines 

The basic idea for an implementation of our framework is that it should be 
directly transformable into a linear logic program. Unfortunately, current lin- 
ear logic programming languages [Mil95] do not offer multiplicative conjunc- 
tion in the rule head. It would therefore be advantageous to find a differ- 
ent fragment of linear logic that directly corresponds to a linear logic pro- 
gramming language. The fragment we introduce in this paper is a slight 
extension of the one used in the linear logic programming language LO. 
A benefit of LO is that it has an 
interpretation as a system of in- 
teracting collaborating systems 
of agents, called Interaction Ab- 
stract Machine (lAM) . This can 
later be used as the basis of 
integrating interaction specifi- 
cations with our approach. We 
will now give a very brief intro- 
duction to the lAM and its in- 
terpretation in LO. For a full in- 
troduction the interested reader 
is referred to [ACP93,AP91]. 

The lAM is a model of in- 
teracting agents in which an agent’s state is fully described as a multiset of 
resources. The agents’ behavior is described by a set of rules called “methods” . 
A method takes the form A\ 'S’ • • • ’S’ A„o— meaning that a multiset con- 
taining the elements Ai, ■ ■ ■ , A„ (the head) can be transformed by eliminating 
these elements and producing effects as specified by the form of B (the body). 
Each resource Ai and each resource in B is syntactically given as a first-order 




Fig. 3. lAM Evolution 
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term f{Xi, . . . ,X„) like in standard logic programming. A method can fire if 
the appropriate head elements are found among the agent’s resources. 

If more than one rule can fire at a given time, e.g. if the agent’s resource set is 
{a,p, q} and its rules are p ’S’ ac^r and q ’S’ ao— s, these rules compete, i.e. one of 
them will be chosen, committed and applied. If the body could only contain the 
connective ’S’, the rules would only allow us to describe the behavior of a single 
agent, but lAMs also use a second kind of connective (in the body of a rule) 
which effectively spawns another agent. All the resources of the spawning agent 
not consumed by the rule application are copied into each spawned agent. This 
connective is the additive conjunction read “with”. & takes priority on ’S’. A 
final lAM operation is to terminate an agent. This is denoted by a T in the body 
of a method. This behavior is illustrated in Figure 3 (redrawn from [ACP93]), 
depicting a possible development of an agent with state {p, a, u} whose behavior 
is specified by the rules p^a^{q^r)&s and r ’S’ u o— T. 

The complete behavior of a method in a multiagent lAM is summarized by 
the following rules: (1) A rule is applicable to an agent only if all the rule head 
atoms occur in the agent’s state. (2) If a rule is applicable and is selected for 
application to an agent, the rule’s head atoms are first removed from the agent’s 
state; (3) The new configuration of the lAM is defined according to the form of 
the body: (a) If the body is the symbol T, the agent is terminated; (b) If the 
body is the symbol T, no atom is added to the state of the agent; (c) If the 
body does not contain any &, then the body elements are added to the agent 
state; (d) If the body consists of several conjuncts connected by &, then for each 
occurrence of cS< a new agent containing a copy of the original agent’s resources 
is spawned. For all the resulting agents (including the original one) the atoms in 
the corresponding conjunct are added to the agent’s state. 

The lAM, even when restricted to the single agent case, defines a form of 
multiset rewriting that is applicable to the rewriting of diagrams as outlined 
above. 

The advantage gained from using the lAM as our basic model is that its 
interpretation can be given in a small fragment of linear logic which only consists 
of the connectives par (’S’), with {&) and linear implication (o— ), as implemented 
by the linear logic programming language LO [AP91]. The most important idea of 
this interpretation is that a terminating evolution of an lAM directly corresponds 
to a successful proof according in linear logic. ^ 



3.2 Diagram Parsing as Linear Logic Programming 

Parsing can be considered as the most basic task in diagram transformation. 
First, it seems of fundamental importance to be able to analyze the correctness 
of a diagram and to interpret its structure. Secondly, context-free parsing ac- 
cording to a multiset grammar corresponds to a particularly elementary form 

^ lAMs and LO also use another kind of connective that emulates broadcasting among 
different agents. We do not need this connective for our tasks and will therefore 
exclude it. 
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of diagram transformation in which each transformation step replaces a multi- 
set of diagram objects with some non-terminal object. We will therefore first 
look at diagram parsing in linear logic, before proceeding to arbitrary diagram 
transformation. Diagram parsing has been studied by a number of different re- 
searchers before. The interested reader is referred to [MMW98,BMST99] for 
comprehensive surveys and to [MMOO] for diagram parsing in linear logic. Here 
we review a particular type of attributed multiset grammars, termed constraint 
multiset grammars (CMGs), which have been used by a number of researchers 
for reasonably complex tasks such as the interpretation of state diagrams and 
mathematical equations. In [Mar94] a precise formal treatment is given, and 
we review only the basic notions here. CMG productions rewrite multisets of 
attributed tokens and have the form 

U ::= U\, ...,[/„ where (C) {E} (1) 

indicating that the non-terminal symbol U can be recognized from the sym- 
bols C/i , . . . , Un whenever the attributes oiUi, . . . , C/„ satisfy the constraints 
C. The attributes of U are computed using the assignment expression E. The 
constraints enable information about spatial layout and relationships to be nat- 
urally encoded in the grammar. The terms terminal and non-terminal are used 
analogously to the case in string languages. The only difference lies in the fact 
that terminal types in GMGs refer to graphic primitives, such as line and circle, 
instead of textual tokens and each of these symbol types has a set of one or more 
attributes, typically used to describe its geometric properties. A symbol is an in- 
stance of a symbol type. In each grammar, there is a distinguished non-terminal 
symbol type called the start type. 

GMGs also include context-sensitive productions. Gontext symbols, i.e. sym- 
bols that are not consumed when a production is applied, are existentially quan- 
tified in a production. As an example, the following context-sensitive production 
from a grammar for state transition diagrams recognizes transitions: 

T:transition A:arc exists Sl:state,S2:state where ( 

OnCircle(A. start, Sl.mid, SI. radius) and OnCircle(A. end, S2.mid,S2. radius)) 

{T. start = A. start and T.tran = A. label and T.end = A. end } 

A diagrammatic sentence to be parsed by a GMG is just an attributed multi- 
set of graphical tokens. Therefore we can view a sentential form as the resources 
of an I AM agent. Intuitively, it is clear that the application of a GMG produc- 
tion corresponds closely to the firing of lAM methods and that a successful parse 
consists of rewriting the original set of terminal symbols into a multiset that only 
contains the single non-terminal symbol which is the start symbol. 

We can map a GMG production to an LO rule (lAM method, respectively) 
and hence to a linear logic implication in the following way: For a GMG produc- 
tion 

U ::= U\, . . . ,Un exists Un+i, ■ ■ ■ , Um where (C) {E} (2) 

we use the equivalent LO method: 

t{ui) T{Ujn)^{C} & {E} & t{u) ^ T{Un+l) T{Um) ( 3 ) 
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In LO, each CMG terminal and non-terminal object Ui will be represented by a 
first order term r(ui) which has the token type of Ui as the main functor and 
contains the attributes in some fixed order. We extend this mapping function 
canonically so that we use r(ui, . . . ,u„) to denote t{ui) ^ ^ T{un). In the 

same way, r(p), for a CMG production p, will denote its mapping to an LO rule 
and t{G) = {t{p) \ p G P} denotes the complete mapping of a CMG G to an 
LO program. 

The linear logic reading of such a method t(p) is its exponential universal 
closure: 

! V r('Ui) T{Um)o-{G} & {E} & t{u) ^ T{Um) (4) 

To evaluate the constraints in the grammar and to compute the attribute 
assignments we assume that the geometric (arithmetic) theory is available as a 
first-order theory Pg in linear logic. Obviously, geometry cannot be completely 
axiomatized in LO, because its fragment of linear logic is too small. However, we 
can encapsulate a more powerful “geometry machine” (and arithmetic evaluator) 
in a separate agent and give evaluation requests to this agent. This is what we 
do by using “with” ( &) to spawn agents for these computations in the above LO 
translation. 

From an operational point of view, this requires us to adopt a proactive inter- 
pretation of LO in which we can spawn an agent and wait for it to return a result 
before proceeding with the rest of the computation. A different implementation 
of a proactive LO interpretation, by sending requests from a coordinator to reg- 
istered participants, is provided by the Coordination Language Facility [AFP96]. 

Each rule r(p) emulates exactly one production p. To emulate parsing fully, 
we also need a rule which declares that a parse is successful if and only if the 
initial diagram is reduced to the start symbol and no other symbols are left. For 
a CMG G with start symbol s, we could do this in linear logic by adding t(s) 
as an axiom to t(G). Unfortunately, from an implementation point of view, we 
cannot formulate true linear axioms in LO. It is more consistent with the LO 
model to extend the language with the linear goal 1, which terminates an agent 
if and only if this agent does not have any resources left (i.e. 1 succeeds if and 
only if the linear proof context is empty). We will call this extension of the LO 
language LOi. Instead of the axiom t(s) we can then add the method r(s)c^l 
to the LOi program. The complete set of LO\ rules that implement a grammar 
G is: 77 = r(G) U {(t(s)o— 1)} 

Operationally, a successful parse of a diagram D now corresponds to an lAM 
evolution with method set 77 starting from a single agent with the resource 
set t{D) in which all agents eventually terminate. Logically, it corresponds to 
a proof of Pg,n h t{D). This linear logic embedding of CMGs is sound and 
complete. 

Theorem 1. 77 G £(G) Eg, 77 h r(77) 

The proof is given in [BMMPOl]. 

LOi is only a minor extension of LO and a proper subset of the linear logic 
programming language Lygon [HPW96]. Thus we still have an executable logic 
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as the basis of our model. In fact, it is Lygon’s sequent calculus [HP94] that we 
will use in the remainder of this article. In contrast to LO which applies rules 
by committed choice, Lygon actually performs a search for a proper proof tree. 
Therefore, if there is a proof for Tg, U h t{D), i.e. if I? € C{G), Lygon will find 
this proof. This is in contrast to LO, which, even disregarding the extension with 
1, can only be guaranteed to find the proof if G is confluent. 

3.3 Negative Application Conditions 

We have shown that context-sensitive rules are expressed in the adopted frag- 
ment of linear logic by simply using the contextual resources in the head and in 
the body of a rule. It is interesting to investigate how negative context condi- 
tions, stating that something must not exist in the diagram, can be expressed. 
Though negative application contexts are not adopted in all grammar-based ap- 
proaches to diagram parsing, there are many visual languages that can only be 
defined with negative conditions. Therefore, some grammar frameworks, such as 
Constraint Multiset Grammars [MMW98] and several types of graph grammars, 
have adopted negative contexts. In particular, in the Single Push-Out Approach 
(SPO) [EHK+97], negative application conditions (NAG) are expressed as an 
addition to the left-hand side (LHS) of a rule, so that for a rule to be applied, 
the mapping m from the LHS to the host graph G must not be extendable with 
a mapping n from the NAC to G. 

Since negation in linear logic is constructive, it is not possible to directly 
state such a negative condition in a form such as a-*- 'S’ b(^c to express that b is 
transformed into c if a is not present. 

Given the importance of negative application conditions for diagram specifi- 
cation and transformation we allow ourselves some speculation how an extension 
of the framework presented above could allow us to handle negative contexts. 

In linear logic the notation o'*", in fact, does not represent the absence of a 
in the multiset, which is what we really want to express. On the contrary, o"*" is 
used to represent a dual resource with respect to a. Intuitively, we can say that 
a-*~ can be seen as the answer to a request of type a, so that they can disappear 
together in a context where a has been requested and o'*' has been produced. In 
the linear logic programming language Lygon [HP94] indeed, negated resources 
are used to represent facts, while positive resources in the right-hand side of a 
rule are goals to be satisfied. 

Let us formulate negative application conditions in the form bo—c&neg{a), to 
indicate that we do not want a to be present or, in other words, that we want the 
proof not to terminate successfully in the branch created by the cS< if a is present. 
We might want to achieve this by using a rule of the form X ^ neg{X)o—^, 
where ^ is a resource which is not used anywhere else and therefore cannot be 
removed. However, this is not valid, since we can not guarantee that this rule 
is really applied. In fact, there is no guarantee that X will not be consumed by 
some other rule or that no rule of type c 'S’ neg{a)o—d could undo the effect of 
producing the negated resource. Moreover, this approach does not allow us to 
obtain a guard on the application of the rule. 
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We have to look for an approach that allows us to model the committed 
choice application of a rule based on negated guards, i.e. which ensures that the 
element was not present at the time of its usage. The problem is that negative 
application conditions essentially express conditions on the global configuration 
of the multiset. Hence, to be sure that a rule is applicable we need to maintain 
a global representation of the state and check it in the rule itself. This can 
be achieved if we keep book of all the resources we produce and consume so 
that the check for non-existence is reduced to asessing membership in a given 
set in the resource records. Since the condition may involve checking values of 
the attributes of a resource, we use resources which include a representation of 
the actual resource. This approach obviously amounts to introducing a limited 
form of metalevel resources in the multiset and its augmentation with reflective 
capabilities. Such an extension has been discussed in [BBMP97]. 

It would also be possible to use a representation purely based on the counting 
of resources to achieve the same effect, but this would lead to a much more 
complex rule set. 

To achieve this, we could instantiate in the initial multiset a collection of 
resources of type recX{SetOfX) for each type X of atoms in the alphabet of the 
grammar. The idea is that SetOfX is simply a (multi)set of all resources of the 
type X that are currently present. It is easy (though complex) to re-write a rule 
set to use these reflective resources. In the above approach a rule of the form 
a{X)o—f(X,Y) & b{Y) where f{X,Y) is some function on the attributes of X 
and Y would then be transformed into the form: 

recA(SetOfA) >S> recB(SetOfB) ^ a(X) 

O- f(X,Y) & diff(SetOfA, a(X), NSA) & 

diff(NSB, b(Y), SetOfB) <& h(Y) ^ recA(NSA) ^ recB(NSB) 

A proof for the operator dijj succeeds if and only if the third argument is 
the multiset obtained by removing the resource denoted by the second argument 
from the multiset in the first argument. In our treatment, we assume that this is 
available through a theory Fg which contains geometric, arithmetic and elemen- 
tary set theory facts as axioms. Tests on the attributes of C are admissible as 
long as their negation can be formalized in Fg. For all practical implementation 
purposes dijf can also be implemented via the host language. 

Now negative conditions can be checked as non-membership in the suitable 
record of a resource with given properties. A rule of the form a(Al) 'S’ neg(c{Z)) 
o— fi{X, Y) & f 2 {X, Z) & b{Y) involving a negative condition on the presence of 
c{Z) will then be written as: 

recA(SetOfA) >S> recB (SetOfB) ^ recC(SetOfC) >S> a(X) 

O- fi{X,Y) &f2{X,Z)& diff(SetOfC,c(Z),SetOfC) & 
diff(SetOfA, a(X), NSA) & diff(NSB, b(Y), SetOfB) & 
b(Y) >S> reeA(NSA) ^ reeB(NSB) ^ recC(SetOfC) 

The price to pay in this approach is obviously the management of reflective 
resources and a much more complex rule-set, which, however, could be generated 
automatically. It is also obvious that this approach can only handle a limited 
type of negation as failure. 
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4 Applications 

In the previous sections we have presented a general approach to diagram trans- 
formation based on linear logic. In this section we present two simple applications 
of this approach: One in which the diagram transformation corresponds to the 
execution of a computation and one in which diagram transformations are used 
to reason in some underlying domain represented by the diagram. 

4.1 Executable Diagrams 

By executable diagrams we refer to diagram no- 
tations that are used to specify the configurations 
of some system. Transformation of such diagrams 
can be used to simulate and animate the transfor- 
mation of these configurations. Typical examples of 
such systems are Petri nets, finite state machines 
and a number of domain specific visual languages. 

For these systems, a multiset representation is intuitively apt and the transfor- 
mation rules for the multiset closely correspond to the diagram transformation 
rules. As an example, consider a transition in a finite state diagram, such as the 
one in Figure 4. 

Let us adopt a set of data types corresponding to the definition of states, 
with a geometry, a name and a couple of attributes denoting whether the state 
is initial and/or final; transitions, defined by their geometry and an input symbol; 
and input labels, which are positioned under the current state, to be read one 
symbol at a time. 

A straightforward translation of the partial diagram of Figure 4 results in 
a rule which corresponds exactly to the semantics of the depicted transition, 
assuming that in a diagrammatic animation of the transformation the input 
string is always placed under the current state: 

state{{W9,24),sl,nonfinal,noninitial) 'S’ tronsition((129,24),(189,24),“a”) 'S’ 
state{{209, 24), s2, nonfinal, noninitial) 'S’ input {{109, 40), [“ a” \ Rest]) 
ly- state{{109,24),sl,nonfinal,noninitial) 'S’ transition{{129,24),{189,24),“a”) 'S’ 
state{{209,24),s2,nonfinal,noninitial) 'S’ input{{209 ,40) , Rest) 

The whole diagram is translated into such a set of rules, one for each transition, 
and its execution can be started by placing the input string under the initial 
state. 

We can, however, have a more general view of this process and define the 
behavior of such animations independently of any concrete diagram: 

state{Geoml,Namel,Fl,Il) 'S’ state{Geom2,Name2,F2,I2) 'S’ 
transition{GeomZ, Lab) 'S’ input{Geom4,[Lab\Resi\) 
o— starts At{Geom3, Geoml) & endsAt{Geom3, Geom2) & 
below{Geom4,Geoml) & below{Geom5,Geom2) & 
state{Geoml, Namel, FI, II) ^ state{Geom2,Name2,F2,I2) 'S’ 
transition{Geom2>, Lab) 'S’ input{Geomb, Rest) 




Fig. 4. FSA Transition 
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where startsAt, endsAt and below are suitable predicates that check the corre- 
sponding spatial relations, possibly instantiating the geom attribute appropri- 
ately. 

In the LOi setting, a rule for expressing acceptance of the input would be: 
stateiGeoml, final, -) 'S’ input(Geom2,\\) o— T &below{Geom2,Geoml). 

Note that the termination of an agent indicates the success of a branch in the 
corresponding proof. 

It is easy to see how the two alternative approaches can both provide an op- 
erational semantics for executable diagrams. In both cases, the actual execution 
of the transformations occurs uniformly according to the LO\ proof system. 

4.2 Diagrammatic Reasoning 

Often we are using diagram transformations not so much to define the config- 
uration of a computational system, but to reason about some abstract domain. 
A typical case of the use of diagrams to perform such reasoning are Venn Dia- 
grams. A variant of these, developed by Shin [Shi95], provides a formal syntax, 
semantics and a sound and complete system of visual inference rules. 

In these diagrams, sets are represented by regions, shaded regions imply that 
the corresponding set is empty, and a chain of X implies that at least one of the 
regions marked by an X in the chain must be non-empty. 

As an example. Figure 5 says that A and B are 
not both empty (expressed by the chain of X) and 
nothing is both in A and in B (expressed by shad- 
ing). By inference we can obtain that the elements 
in B and in A must belong to the symmetric dif- 
ference A /S, B = A - B U B - A. This diagram pig. 5 . a Venn diagram 
is equivalent to one in which the X in the shaded 
region is removed. Such an equivalence is expressed 

by the “Erasure of Links” inference rule. This can be stated as “an AT in a shaded 
region may be removed from an AT-chain provided the chain is kept connected.” 
We reformulate this textual rule as a set of linear logic rules, defined on the 
following graphical data types: (1) chain, associated with an attribute setOfX 
which stores the locations of the X elements in the chain; (2) x, with an attribute 
pt giving its position and an attribute num, giving the number of lines attached 
to it; (3) line, with an attribute ends giving the positions of its two ends; (4) re- 
gion, with an attribute geom, allowing the reconstruction of the geometry of the 
region, and an attribute shading, indicating whether the region is shaded or not. 
We assume that these elements are correctly and consistently identified in the 
diagram. Additionally, some synchronization resources are used to indicate that 
the transformation is performing some not yet completed process. Link erasure 
is defined by the following actions: 

(1) A point inside a shaded region is eliminated and the set in the chain is 
updated accordingly. A synchronization resource is used to ensure that all the 
elements previously connected to it will be considered: 
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chain(SofX) 'S’ x{Pt, Num) 'S’ region{Geom, shaded) 
o— chain{G, Pt, Num) 'S’ region{Geom, shaded) & 
inside{Pt, Geom) & diff{SofX, Pt, G) 

(2) Points previously connected to the removed element are marked and the 
connecting lines are removed: 

chain{G, Pt, Num) 'S’ line{Ends) 'S’ x{Ptl,Numl) 

o— equal{Ends, {Pt, Ptl}) & minus{Numl,l, Numll) & 
chainiG, Pt, Num) 'S’ x{Ptl,cand, Numll) 

(3) If the removed point was inside the chain, its two neighbors are connected 
by a new line. Synchronization resources are removed and a consistent state is 
restored: 

chain{G, Pt,2) 'S’ x{Ptl,cand, Numl) 'S’ x{Pt2,cand, Num2) 
ly- equal{Ends, [Ptl, Pt2}) & plus{Numl,l, Numll) & 
plus{Num2,l,Num21) & chain{G) 'S’ x{Ptl, Numll) 'S’ 
x{Pt2,Num21) 'S’ line{Ends) 



(4) If the removed point was at an end of the chain, its neighbor is now at an 
end or it is isolated. 

chain{G, Pt,l) 'S’ x{Ptl,cand,l) o— chainiG) 'S’ xiPtl,l) 
chain(G, Pt,l) 'S’ x{Ptl,cand,0) o— x{Ptl,0) 

(5) If the removed point was an isolated element, the diagram was inconsistent, 
and the chain is removed altogether: 

chain{G, Pt, 0) _L 

The erasure process goes through intermediate steps in which the diagram is not 
a Venn diagram (for instance, dangling edges appear in the chain). Such inconsis- 
tent diagrams correspond to states in which synchronization resources appear in 
the multiset. The process is, however, guaranteed to terminate with a consistent 
diagram. Such situations often occur in diagrammatic transformations, where a 
complex step is broken up to produce several intermediate diagrams, of which 
only the start and final diagram belong to the language of interest. The prob- 
lem of deciding whether a diagram produced during the transformation process 
belongs to the language or is just an intermediate diagram can in many cases be 
solved without resorting to a new full analysis of the diagram. In fact, knowing 
that the starting diagram was in the language and knowing the possible transfor- 
mations, we can usually define some simple test for the validity of a transformed 
diagram. For example, among all the multisets produced during the link erasure 
process, all and only those which do not contain any synchronization resource 
represent a Venn diagram. 

In general, a language L can be specified by a triple (Lq, ^*,Lf), where Lq is 
an initial language, =!>* is the reflexive and transitive closure of the yield relation, 
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and Lf is a, final language acting as a filter for sentences produced starting from 
Lq according to =i>*, i.e. L = {s \ 3s' G Lq '■ s' s} n Lf. 

This view was proposed for string languages in [Man98] and independently 
adopted for the diagrammatic case in [BCM99,BPPS00]. This suggests a line of 
attack for typical problems in diagram transformations related to the possible 
production of inconsistent diagrams. In our approach, the filter language can be 
characterized by a set of LOi rules. A valid state in a diagram transformation 
process is one for which there exists an LOi proof of the filter property. 

As an example, consider the dangling edge problem which is typical of graph 
transformation systems. The double-pushout approach to algebraic graph trans- 
formation [CMR+97] faces this problem by not allowing deletion of a node if its 
elimination would leave dangling edges after rule application. From our perspec- 
tive, this could be modelled by giving a simple set of LO\ filter rules. Indeed, 
a proof using the following program terminates successfully (i.e. with 1) only if 
all edges are eliminated. 

edge{Gl) 'S’ node{G2) 'S’ node{G3) 

o— touches{Gl, G2) & touches{Gl,G3) & node{G2) ^node{G3) 
node{3) o— T 
node{-) o— 1 



Again, this “filtering” set of rules is seen as an ex-post check that the ap- 
plication of rules was correct. The implementation of checks to avoid dangling 
edges would require the encoding and maintaining in the nodes of a record of 
the touching edges and a check that all touching edges have been removed be- 
fore removing the node. As seen in Section 3.3, guarded rule application would 
require the use of metalevel representations. 

5 On a Class of Local Transformations 

We have seen that the semantics of graphs defining finite state machines can be 
given in two alternative ways: the first consists of generating a set of rules from 
the analysis of the graph and providing the initial configuration of the process 
(with the string to be analysed in a position corresponding to the initial state); 
the second consists of giving the general law for transitions in the form of a 
set of rules and providing the whole description of the diagram, in the form of 
a multiset of resources together with the initial position of the string. In both 
cases the inference calculus realises the computation. In a sense we could say 
that in the first case we directly compile a concrete diagram into the rule set 
whereas in the second case the rule set represents a general interpreter for such 
diagrams. 

This is possible because a notion of transition results explicitly defined in the 
visual representation, (depicted as arrows in the finite state machines), such that 
the pre-and post-conditions of the transition are directly inferrable from the dia- 
gram. A process corresponds to a sequence of markings of the diagram, according 
to the specifications of the pre- and post-conditions of the fired transitions. 




60 



Paolo Bottoni, Bernd Meyer, and Francesco Paris! Presicce 



In general a visual transformation s can be defined by specifying a law for 
transforming a diagram d into a diagram d' in the same visual language. We 
indicate with pre{d, s) the set of elements in d involved in the precondition for s 
and with post{s, d'), the analogous set of elements involved in the postcondition. 
A couple of partial functions p : pre{d,s) post{s,d') and p' : post{s,d') 
pre{d, s) exist, relating elements in the two diagrams. 

Visual languages providing explicit representation of the transition enjoy 
some properties, that we now consider independently to investigate which is 
the class of languages for which such a dual approach can be taken. To start 
this research we first introduce the property of locality, based on a notion of 
reachability between elements. For the purpose of this discussion we consider only 
connectedness and containment as significant relations between visual elements. 
We say that an element A is reachable from another element B if there is a 
navigation path composed of only relations of connectedness and containment 
leading from B to A. Reachability is here taken as a symmetric relation, in 
the sense that the navigation does not have to take into account the possible 
existence of directions in the connections. This is to say that if A is contained 
into B, we consider that A and B are reachable from one another, and similarly 
if they are connected by a directed edge. If we extend the relation to admit 
that any element is reachable from itself, reachability becomes an equivalence 
relation, as it is obviously transitive. We can now establish a notion of distance 
between elements in the following way. Let P{A, B) be the set of paths between 
A and B.^ 



rK A ~ length{P{A, B)) if reachable{A, B) 

’ oo otherwise 

We can now formalise the notion of explicit representation of a transition as 
follows: given a transition step s, there exist in d exactly one graphical element t, 
a collection of graphical elements P = {pi, . . . ,p„} representing the elements af- 
fected by the transformation and such that reachable{t,pi) holds for i = 1 , . . . , n 
and there exist two collections of graphical elements Mi = {rrii^i , . . . , mi^m} and 
M2 = {w2,i, . . . , rn2,n2}- Then there are two mappings pre : P 1-^ M\ U {blank} 
and post : P M2 U {blank}, such that M\ is a marking for P iff the system 
is able to perform s, and M2 is a marking for P iff the system can have just 
performed s. 

Such a definition appears as a rephrasing of the firing condition for Petri 
nets, but one can see that with the suitable restrictions on P, it occurs also for 
finite state machines. For example, a transition from a state si to a state S2, 
consuming a symbol X from an input A o i? is visualised by considering P = 
{si) S2}, Ml = string(XoR) M 2 = string(R), pre(si) = string(XoR), pre{s2) 
= blank, post{s\) = blank, post{s2) = string(R). 

In languages supporting an explicit representation of transformations, p and 
p' constitute a bijection and pre{d,s) = p'{post{s,d')) = P. Hence a diagram 

^ Essentially d{A, B) is the path length between A and B in the underlying relationship 
graph. 




Visual Multiset Rewriting: Applications to Diagram Parsing and Reasoning 



61 



depicting an initial state may vary only for its markings. This observation may 
be generalised to define a class of local transformations. 

We say that a transformation is local if and only if there exist an integer Ki 
such that for any pair of diagrams d, d' and transition step s transforming d into 
d' , the following hold: 

1. Vx G post{s, d') 3y G pre{d, s) s.t. d{x, p{y)) < K\ 

2. there is an integer Kd such that no sequence of transformation steps may 

affect more than Kd elements. 

In practice, this notion of locality rules out transformations involving sets 
of elements of arbitrary cardinality or at an arbitrary distance from elements 
mentioned in the precondition of a transition. 

Observation 1 If a diagrammatic system supports an explicit representation of 
transitions, then all the transformations realisable on the system are local. 

Reasoning. If transitions are represented explicitly, then for any transition there 
is a set of elements in the pre- or post-conditions that are reachable from the 
element representing the transition. Moreover, since the diagram is modified only 
for its marking, an inspection of the initial diagram provides the constants Ki 
and Kd. 

One can observe, however, that the notion of locality goes beyond that of 
explicit representation of transitions. In particular, it enlists all transformation 
laws that can be expressed in the form of before-after rules, where the left-hand 
side enumerates all the elements in the preconditions and the right-hand side 
all the elements in the postconditions, without any form of side effects. Typical 
examples are Agentsheets [RS95] or Kidsim [SCS94] rules. Such a notion can 
be formalised by constraining p and p' to be injections such that if p'{p{e)) is 
defined for some e G pre{d,s), then p' {p{e)) = e and if p{p' {e')) is defined for 
some e' G post{s,d'), then p{p'{e')) = e'. 

We can now relate the general notion of locality to our linear logic framework. 

Observation 2 If a transformation law is local, then it is possible to translate 
a visual sentence defining a dynamic system into a collection of rules in the 
considered fragment of linear logic, such that any sequence of applications of 
these rules corresponds to a possible dynamics of the system. 

Reasoning. In a local transformation law, the effect of a transition step can be 
described through a rule which presents all elements in the pre-condition in the 
left-hand side, and all elements in the post-condition in the right-hand side. Since 
all elements in the pre-condition must be present for the rule to succeed, we can 
use the connective 'S’ to state this fact. The same reasoning justifies the use of 
'S’ in the right-hand side. Conditions on attributes can be mapped to constraints 
involving elements in the right-hand side, and can be used as conjuncts for the 
connective Since elements in local transformations are finite, a finite set of 
rules is derived. 
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The above suggests the general form of the rules directly derivable from a 
diagram: 

originators{X)o—transf{X, Y) & affected{Y) 

where we have slightly abused the notation, by collecting into the terms orig- 
inators and destinations the (fixed) set of terms respectively in pre{d, s) and 
post{s,d'). The resource trans/ summarises the operators defining the modifica- 
tion of the marking and the check that additional preconditions for the transition 
are met. 

As noted before, the types of transformations considered above have also 
another characteristic: they do not modify the topology of the diagram along 
transformations. 

We say that a visual transformation is topology preserving if for any diagram 
d and transition step s, for any two elements x, y appearing either in D = 
pre{d, s)Ufj,' {post(s, d')) or in D' = post(s, d')Ufj,{pre{d, s)), the relations between 
X and y are the same in D and D' . 

Observation S If a diagrammatic system supports an explicit representation of 
transitions, then it describes a topology preserving transformation. 

Reasoning. Indeed, since transitions are defined once and for all, their application 
can only modify the marking of the diagram, and not the topology. 

A weaker condition is for a transition to be reachability preserving, holding 
if for any diagram d and transition step s, for any two elements x,y G pre{d, s)) 
such that reachable{x,y) holds in d, ii x,y G post{s,d'), then reachable{x,y) 
holds in post{s,d'). In words, if two elements are present in a diagram both 
before and after a transition step, then they have to remain reachable. 

Observation 4 If a transformation law is topology preserving, then it can be 
expressed by a collection of rules in the considered fragment of linear logic. 

Reasoning. In a topology preserving transformation law, the set of elements to 
be checked for any transition can be obtained from navigation along connec- 
tions. The application of the rule requires the identification of the elements to 
be checked in the precondition for any transition, the checking of the precondi- 
tion, the identification from the postcondition of the elements to be transformed, 
and their actual transformation. All these steps can be performed through the 
adopted connectives, namely 'S’ for identifying elements and & for checking con- 
ditions. 

Also in this case, the argument suggests the form of the set of rules repre- 
senting the general transformation law. In this case, the set of elements involved 
in the transformation has to be constructed. 

The first rule starts the process of collecting the originators for a given tran- 
sition Dep, assuming that some trigger occurs from outside and is inserted in 
the multiset and that a resource transf(Dep) is used to represent a transition. 

trigger(Dep) 'S’ transf(Dep) o— transf{Dep, collect, nil) 
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The second rule is applied as long as there are originators to collect. 

transf{Dep, collect, Store) 'S’ originator{X) o— 

include{Dep, X, Store, New Store) & nonComplete{Dep, Store) & 
transf [Dep, colleet, NewStore) 

The third rule evaluates the actual transformation and starts the process of 
distributing the new configuration elements. 

transf{Dep, collect. Store) o— 

complete(Dep, Store) & eval{Store, Result) & transf(Dep, distrib. Result) 



The fourth rule is applied as long as there are results to distribute. 

transf{Dep, distrib. Store) o— 

take{Dep, Store, Y, NewStore) &. nonempty{Store) & 
destinationiy) 'S’ transf{Dep, distrib, NewStore) 



The final rule allows starting the transition again if a trigger is generated. 
transf{Dep, distrib, nil) o— transf(Dep) 

Again, we have used generic resources originator and destination to indicate 
elements which are affected, either in pre- or in post-conditions by the transition. 

The use of transf(Dep) prevents a transition to be started again before the 
completion of the procedure, while allowing concurrent execution of several tran- 
sitions. It might be the case that conflicting transitions are triggered together, 
so that some form of mutual exclusion must be realised. This can be achieved 
by having a global available resource, of which at most one copy may exist in 
the multiset, removed any time a procedure starts and placed back each time 
it finishes. In this way consistent states of the diagram correspond to possible 
states of the system according to an interleaving semantics. 

The procedure here described relies on some suitable reflective encoding of 
the connectedness between elements, i.e. an element must have access to a de- 
scription of its neighbour set. Such an encoding, namely a counting of neigh- 
bours, has been used in the procedure for link erasure in the previous section. 
Such a counting is in general sufficient and can be consistently updated if we can 
overview side effects. For example, if the containment relation is important, then 
rules must be defined which allow removal of containers only if they are empty. 
In general the encoding must be restricted to immediately reachable elements 
and not to transitive closures, so that updating of topological information can 
be performed on a local basis. In this way, we avoid problems connected with 
the representation of spatial relations described in Section 2. 

More general metalevel capabilities are required if rule application depends 
on global properties of the multiset, such as in transformations which can occur 
only if the collection of all instances of a given type enjoys some properties. 
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In this case, a record of the object level resources of type X must be kept in 
metalevel resources of type recX. 

Combining the two conditions above, we obtain the following result: 

Observation 5 If a transformation law is both local and topology preserving, 
then the system dynamics can he equivalently expressed as a set of rules contain- 
ing at least an instantiated attribute for each term and an initial multiset, or as 
a set of rules containing only free variables and an initial multiset. 

For systems such as FSA, Petri Nets or dataflow diagrams, the set of origina- 
tors and destination elements coincide. For the case of graph-like diagrams, the 
locality requirement, and dually the topology preserving one, rules out trans- 
formations which involve the updating of arbitrary embeddings. Basically, the 
requirements establish a sort of context-freeness property. This is not meant to 
say that our embedding only accounts for such context-free transformations, but 
that the duality between the two representations is only possible in this case. 

These patterns of rule sets are definable not only for graph-like diagrams, but 
also for other types of diagrams. For example, consider a diagrammatic language 
in which atomic elements are located into container elements and transforma- 
tions may affect only elements within a same container (an example of such a 
system was described in [BBMP96]). In this case the requirements for context- 
freeness are satisfied only if the allowed transformations are restricted to the 
following types: 

1. Elements can be individually created within or removed from a container 

2. Transformations involve a fixed set of elements 

3. Containers are created empty 

4. Containers can be removed only if they are empty. 

6 Conclusions 

We have shown how diagram transformation can be formalized in linear logic 
and we have discussed interpretations in multiset rewriting. Many important 
kinds of diagrammatic reasoning, which can be understood as syntactic diagram 
transformation, can be formalized in this way. 

We have discussed a small fragment of linear logic that is expressive enough 
to model diagrammatic transformations, yet small enough to directly correspond 
to a calculus of linear logic programming. Our formalism therefore is a directly 
executable specification language. We have also proven equivalence of our model 
with attributed multiset grammar approaches and correctness of the correspond- 
ing mapping. 

The paper also described how to manage a restricted form of negative context 
conditions checking the non-existence of certain resources or ensuring exhaus- 
tive rule application. Such conditions cannot be directly stated on the resources 
present in the multiset, but require the use of metalevel resources maintaining a 
record of the object-level resources. 
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Ultimately, we are interested in specification languages for diagram notations 
in which the rules themselves are visual. The idea is that this can be formalized 
by an additional mapping between linear logic rules and visual rules. Such an 
approach necessarily raises the question if and when visual rules are adequate 
to describe a transformation system. We hope that the ability to formalize the 
transformation as well as the embedding conditions and the underlying geometric 
theory within the unifying framework of linear logic will allow us to develop 
formal criteria that help to answer this important question. 
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Appendix A: Linear Seqnent Calcnlus 



This appendix shows the relevant rules of the sequent calculus presented 
in [HP94]. 



ih 



(ax) 



rh6.A r, (5 b A 



r, r h A, A 



(cut) 
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Abstract. We propose a characterization of a Chomsky-like hierarchy 
of multiset grammars in terms of multiset automata. We also present the 
deterministic variants of all the automata investigated and prove that, 
unlike the string case, most of them are strictly less powerful than the 
nondeterministic variants. Some open problems and further directions of 
research are briefly discussed. 



1 Introduction 

In [8], a formalism based on multiset rewriting, called constraint multiset gram- 
mar, is introduced with the aim of providing a high-level framework for the def- 
inition of visual languages. They have been investigated in [11] from the parsing 
complexity of visual languages point of view. These grammars may be viewed 
as a bridge between the usual string grammars, dealing with string rewriting, 
and constraint logic programs, dealing with set rewriting. Other devices based on 
multiset rewriting have been reported in [16,17] where abstract rewriting multiset 
systems were used for modelling some features of the population development 
in artificial cell systems. Furthermore, a Chomsky-like hierarchy of grammars 
rewriting multisets was proposed in [9] . In all these approaces, the multisets are 
processed in a sequential, non-distributed way. 

On the other hand, other authors have considered parallel or distributed 
ways of processing multisets for defining computational models inspired from 
biochemistry [1,2,3] or microbiology [12,13,14]. Different properties of commuta- 
tive languages (macrosets in our terminology) have been considered in a series 
of papers, see, e.g., [5,10] and their references. 
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entifica, SB 97-00110508 
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We address in this paper the accepting problem of multisets by means of 
some devices similar to automata from the formal language theory. 

We shall assume the reader familiar with the fundamental concepts of formal 
language theory and automata theory; all notions we use can be found in [15]. 
We denote by [l..n] the set of natural numbers 1,2, ...,n. For a finite set A 
we denote by card{A) the cardinality of A. An alphabet is always a finite set 
whose elements are called symbols. A string over an alphabet 17 is a mapping 
s : [l..n] — > V , for some n which is called the length of the string s, denoted by 
jsj. By convention, we shall consider also the empty string of length 0 which is 
written e. Moreover, for each a G V we set jsja = card{{i \ s{i) = a}). The set 
of all strings over an alphabet V is denoted by V* (the free monoid generated 
by V with the concatenation operation); moreover, V~^ = V* — {e}. A subset of 
V* is called language. For an alphabet V = {oi, 02, . . . , Ofc}, the Parikh mapping 
associated with V is a homomorphism ipv from V* into the monoid of vector 
addition on N^, defined by 'd^vi.s) = (jsjai , |s|a 2 > • • • ? Isja*,); moreover, given a 
language L over V, we define tpv{L) = \ x G L}. 

A finite multiset over an alphabet P is a mapping cr : V — > N; a(a) expresses 
the number of copies of the symbol a in the multiset a. The empty multiset is 
denoted by e, that is e(a) = 0 for all a G V. The set of all multisets over V is 
denoted by V° while the set of all non-empty multisets is denoted by P^. A 
subset of P° is called maeroset. The weight of a multiset ct is || ct ||= ^(®)> 

and II a \\u= Ylaeu subset U of P. 

One can naturally associate with each multiset cr the finite language = 
{w I |w|o = cr(a) for all a G V}, and conversely, with each string w G V* one 
can associate the multiset pLw ■ V — > N defined by Hw{a) = |w|a- Clearly, the 
language formed by the empty string only is associated with the empty multiset 
and the empty multiset is associated with the empty string. Therefore, each 
multiset a over P can be represented, in a one-to-one manner, by the vector 
<Fy(Tcr), therefore, we do not distinguish between macrosets over an alphabet of 
cardinality k and sets of vectors from N*. 

Normally, we use lower case letters from the end of the Latin alphabet for 
strings and small Greek letters for multisets. Furthermore, the Greek letter pL is 
used with an index only, for the multiset associated with the string denoted by 
the index. 

For two multisets a, r over the same alphabet P we define: 

— the inclusion relation ct C t iff cr(a) < T(a) for all a G P; 

— the addition multiset ct + t defined by ct + r(a) = ct(o) + r(a) for all a G P; 

— the difference multiset a — t defined by (ct — r)(a) = a(a) — r(a) for each 

a G P, provided that t C ct. 

2 Multiset Grammars 

We recall from [9] the definition of a multiset grammar. A multiset grammar 
is a construct G = {N,T, S, P), where N,T are disjoint alphabets, referred as 
the nonterminal and the terminal one, respectively. S' is a finite macroset over 
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Nut, called the starting macroset, P is a finite set of multiset rewriting rules 
of the form a ^ t , where a, r are multisets over N UT and ^ 1- 

For two multisets ai ,02 over iV U T, we write a\ «2 if there exists a 
multiset rule r : a ^ t G P such that a C a\ and 02 = <ai — <t + r. We denote 
by the reflexive and transitive closure of the relation The macroset 
generated by G is defined by 

Gen(G) = {a&T*\S =^* a}. 

Following [9], a natural Chomsky-like classification of such grammars is given 
below: 

1. Grammars G as above (without any restriction on the multiset rules) are 
said to be arbitrary. 

2. If li (T ||<|| r II for all rules cr ^ t in P, then G is said to be monotone. 

3. If II (T 11= 1 for all rules a ^ t in P, then G is said to be context-free. 

4. If II cr 11= I, II T ||< 2, and || r ||at< 1 for all rules a ^ t in P, then G is said 

to be regular. 

We denote by mARB, mMON, mCF, mREG the families of macrosets gener- 
ated by arbitrary, monotone, context-free, and regular multiset grammars, re- 
spectively. By REG, CE, MON, ARB we denote the families of regular, context- 
free, context-sensitive, and recursively enumerable languages, respectively. For a 
family of languages P, we denote by PsF the family of Parikh sets of vectors as- 
sociated with the languages in F. The family of all semilinear sets is denoted by 
SLin while the family of all linear sets is denoted by Lin. Furthermore, mFIN 
stands for the class of all finite macrosets while PsFIN stands for the class of 
all finite Parikh sets of vectors. 

Following [9], the aforementioned classes of macrosets are related as in the 
diagram from Figure 1, where a pair of arrows indicates a strict inclusion of the 
lower class into the upper class, and a single arrow indicates an inclusion for 
which the properness is open. 

The proper inclusions mMON C PsMON and mARB C PsARB appear to 
be of interest. The first relation is no longer unexpected if we remember that all 
variants of scattered context grammars generate only context-sensitive languages 
[7,4,6]. 

3 Multiset Finite Automata 

Roughly speaking, a multiset finite automaton consists of an input bag in which 
a multiset is placed, and a detecting head which can detect whether or not a 
given symbol appears in the bag (there is at least one copy of that symbol in the 
multiset). The automaton works as follows: it starts in the initial state having a 
multiset in its bag and changes its current state depending on the former state 
and the detection of a symbol in the bag. If a symbol has been detected, it 
is automatically removed from the bag. The automaton stops when no further 
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PsARB 




mARB PsMON 




mMON 



niREG = mCF = PsREG = PsGE = SLin 



niEIN = PsEIN 

Fig. 1 . 



move is possible or the bag is empty. If the bag is empty and the current state 
is a final state, then the automaton accepts the input multiset. In all the other 
situations, the initial multiset is rejected. 

Formally, a multiset finite automaton (MFA, for short) is a structure 

A={Q,VJ,qo,E), 

where Q is a finite non-empty set of states, V is the input alphabet, qo is the 
initial state, E is the set of final states, and / is the transition mapping 

f :QxV — > 2 ^. 

A configuration of A is a pair {q,r), where g is a state (the current state) and 
T is a multiset (the current content of the bag). We define the following relation 
on the set of all configurations: 

(<7, r) h (s, p) iff there exists a £ V such that s £ f{q, a) and t — p = Pa- 

The reflexive and transitive closure of this operation is denoted by h*. The 
macroset accepted by A is defined by 

Rec{A) = {t I (qo,T) h* (g,e) for some q £ F}. 

Otherwise stated, the macroset accepted by the finite automaton A is the set of 
all multisets r for which the automaton A starting in q^ and having r in its bag 
reaches a final state with an empty bag. 

Example 1 We consider the multiset finite automaton A = {{q[),qi,q2\ , 

90; {90}); where the transition mapping f is defined as in the next diagram: 
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t 



a 




c 



h 




Fig. 2. 



It is easy to notice that Rec{A) = {r | r(a) = r(6) = r(c)}. 

The well-known proof for showing the equivalence of finite automata and regular 
string grammars, with respect to their computational power, can be easily han- 
dled in order to prove the equivalence of multiset finite automata and multiset 
regular grammars with respect to the same criterion. We denote by C{MFA) 
the family of all macrosets accepted by multiset finite automata. 

Proposition 1 C{MF A) = mREG = mCF = PsREG = PsCE = SLin. 

A MFA A = {Q, V, /, qoj P) is said to be deterministic (DMFA for short) if the 
following two conditions are satisfied: 

(i) card{f{q,a)) < 1 for all g G Q and a GV. 

(ii) For each state q, if card{f{q, a)) yf 0, then card{f{q, b)) = 0 for all b ^ a. 

The equivalence relation between nondeterministic and deterministic finite 
automata on strings naturally suggests the same question regarding the DMFAs 
and MFAs. For it is easy to notice that there exist finite macrosets which cannot 
be accepted by any DMFA, the answer is immediate. However, the next result 
states more, namely 

Proposition 2 1. L{DMF A) is incomparable with Lin and the family of finite 
macrosets, with respect to the set inclusion. 2. C(DME A) C L{MEA). 

Proof. Clearly, the second assertion follows immediately from the first one. We 
shall prove only the relation between C{DMP A) and Lin. To this aim, we 
consider the linear set L = {x(l,2) -|- y(2, 1) \ x,y G N}. Assume that L is 
accepted by a DMFA A = {Q, {a, b}, /, qo, P). Since both multisets (2n-|-4, n-|-5) 
and (2n -|- 4, 4n -I- 2), for a given arbitrary large n, belong to L, it follows that 
there exist two final states gi , ?2 G Q such that 

(go, (2n + 4, 4n + 2)) h* (gi, (0, 3n - 3)) h* {q^, (0, 0)). 

Because there are repeated states in the computation (gi,(0,3n— 3)) h* (g2,(0,0)), 
one infers that vectors of the form (2n -I- 4, m), with arbitrary large m, have to 
be in L, a contradiction. 

Conversely, the DMFA defined by the transition mapping 

g(so, a) = si, g(si, a) = S2, g(s2, b) = S3, 5(53, c) = si, 

having the initial state sq and the final states si and S3, accepts the macroset 
{(n -I- 1, n, n) | n > 0} U {{n + 2,n + l,n) | n > 0} 
which is not linear, and the proof is complete. 
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We now consider a slight extension of multiset finite automata in the following 
way. This automaton, called multiset finite automata with detection (MFAD, for 
short) is able to change its state not only when some symbols are detected in its 
bag but also when some symbols do not appear in the bag. More precisely, an 
MFAD is a structure A = {Q,V, f,qo, F), where all parameters are defined as 
for a MFA, excepting the transition mapping which is defined onQx(yuy)} 
into Q. By V we mean an alphabet containing all barred copies of the symbols 
in V. A move defined by the function / applied to a pair (q,a), a € is the 
same move defined by the transition function of a MFA applied to the same 
pair, while a move defined by the function / applied to a pair (g, a), a G V, may 
determine the automaton to enter a state from f{q, d) providing that no copy of 
a is in the bag. More precisely. 



if and only if 



(q,r) h (s,p) 



(i) there exists a G V such that s G f{q, a) and t — p = pa, or 

(ii) there exists a gV such that s G f{q, d), r(a) = 0 and p= t. 

A MFAD A = {Q,V, f,qo, F) is said to be deterministic (DMFAD for short) if 
the following two conditions are satisfied: 



(i) card{f{q, x)) < 1 for all q € Q and x GV \JV. 

(ii) For each state q and a G V , card{f{q,a)) yf 0 or card{f{q,d)) ^ 0, then 

card{f{q, y)) = 0 for all y G {V \ {a}) U (t^ \ {d}). 



This extension does not induce any increase of the computational power of 
nondeterministic multiset finite automata but it does it for the deterministic 
variants. Denoting by £{MFAD) and £{DMF AD) the families of macrosets 
accepted by MFADs and DMFADs, respectively, we state: 

Proposition 3 1. £{MFA) = £{MFAD). 2. £{DMFAD) contains all finite 
macrosets. 3. Lin and £{DMFAD) are incomparable with respect to the set 
inclusion. 



Proof. 1. Let A = {Q, V, /, qo, F) be a MFAD; we assume that V = {oi, 02 , . . . , 
a„} for some positive integer n. Construct the regular multiset grammar G = (iV, 
V, S, P), where 

N = {{q,x) \ q G Q,x GV*, \x\ < n, \x\ai < G [l..n]}, 
s = {h-iqo.x) \ q G Q,x GV*,\x\ < n, |x|a, < 1, i G [l..n]}, 

and P contains all rules defined as follows: 

— For each transition q G f{s,a), add the rules P(s,x) La{q,x) for ah nonter- 
minals (s,x), (q,x) G N, providing \x\a = 0. 

— For each transition q G /(s,d), add the rules P(^s,x) h‘{q,y)^ for all nonter- 

minals (s,x), (q,x) G N, where y = xa, if \x\a = 0, or y = x, otherwise. 

— For each final state q G F, add the rules P{q,x) for all nonterminals 

(q,x) G N. 
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Let 






, h* . . . h* . . . , fc^-'))) h* 

■•,^i’”^)) = (s™,(0,0,...,0)) 

be a computation in A, which accepts the multiset {kf^\k^\ . . . , kn'^), such that 
the following conditions are satisfied: 

(i) qi e f{si,aji)Ji e G 1]. 



{qi,{k[^\k^2\.. 

{q2,{kf\k‘'^ \ . . 






(ii) Each computation (qi, , kn^)) b* (s^+i, (ki~'~^\k 

kn~^^^)), i G [0..TO— 1], contains no transition based on checking the presence 
of a symbol in the current bag. 

One can construct the following derivation in G: 



A* 



di+l) I,(*+l) 



Po 



m 



Pi 



m 



■ Pm—l 



Vm+lj 



where 



Po{{qo,e)) = 1 , 

Pi{{si,yi)) = 1, where yi is the shortest string containing 
all symbols Qj^,aj^, . . . ,aj^_^,r]i{ap) = k^'> — k^^\p G G 

Pi{{qi,yi)) = 1, where yi is the shortest string containing 
all symbols aj^ aj^, pi{ap) = k^^^ — k^p\p G i G [l..m — 1], 

Pm+i{ap) = kf\p G 

Note that each symbol in N U E which does not appear in the definition of some 
multiset from above has no occurrence in the respective multiset. 

Therefore, G can generate all the multisets accepted by A. The converse 
inclusion follows in the same way, and this concludes the first assertion. 

2. Let L = ri 2 \ • ■ . , rifc | * G [L.p]} be an arbitrary finite macroset 

over the alphabet V = {oi, 02 , . . . , akj, having its vectors listed in the lexico- 
graphical order. We prove by induction on the cardinality of V that L belongs to 
L{DMF AD). If A: = 1, then a DMFAD which accepts L is presented in Figure 3. 




Fig. 3. 



We assume that 



,(i) 



= n 



( 2 ) 



= . . . = n 



(ii) 



< n 



bi+i) 






(* 2 ) 



< . . . < n 



= n 



its) 



= n 



ip) 



(is — 1 + 1) 
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By the induction hypothesis, for each i G let At = (Qi, F\{ai}, Fi) 

be the DMFAD which accepts the language 

Lu = \ j G [ti_i + 

where to = 0- Moreover, the sets Qi are pairwise disjoint. 

We construct the DMFAD 

^ = (U|=iQ* U {go, 9i, ■ • ■ , 9„(p) }, V, Qo, /, ^t=lF^) 

‘‘'1 

as shown in the next figure. 





Fig. 4. 



More formally, the transition mapping is defined by 

f{qj,ai) = qj+i,j G [0..^^^ - 1], 

/(g (tp,ai) = q^o^\j G [l..s], 

f{q,ai) = fj{q,ai),l G [2..fc], provided that g G Qj. 

Cleary, the automaton is deterministic and it accepts the given macroset L. 

3. The linear macroset considered in the proof of Proposition 2 cannot be rec- 
ognized by any DMFAD - a similar reasoning to that used in the aforementioned 
proof is valid - which completes the proof. 



4 Multiset Linear Bounded Automata 

A multiset linear bounded automaton has a bag able to store a finite multiset 
and a read-write head which can pick up (read) a symbol from the bag and add 
at most one symbol to the contents of the bag. Adding a symbol is meant as 
increasing the number of copies of that symbol by 1 . Being in a state, the multiset 
linear bounded automaton reads a symbol which occurs in the bag, changes its 
state and adds or not a symbol to the bag contents. When the machine stops 
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(no move is possible anymore) in a final state with an empty bag, we say that 
the input multiset is accepted, otherwise it is rejected. 

Formally, a multiset linear hounded automaton (shortly, MLBA) is a construct 



M = {Q,V,U,f,qo,F), 

where Q and F are the finite sets of states and final states, respectively, V and 
U are the input and the bag alphabets, respectively, V C U, qo is the initial 
state, and / is the transition mapping from Q x U into the set of all subsets of 
Q X {U U {e}). As in the case of MFAs, a configuration is a pair {q, r), where q 
is the current state and r is the content of the bag. We write 

{Q,r) h {s,p) 

if and only if there exists a G V such that 

(i) (s, b) e f{q, a), 6 yf e, r(a) > 1, p{a) = r(a) - 1, p{b) = r(6) + 1, 
and p{c) = t(c) for all c G F, c ^ {a, b}, or 
(ii) (s, e) G f{q, a),T(a) > 1, p(a) = r(a) - 1, 
and p(c) = r(c) for all c G V, c yf a. 

The reflexive and transitive closure of this operation is denoted by ^*. The 
macroset accepted by A is defined by 

Rec{M) = {t I (goi'r) |=* for some q G F}. 

The deterministic MLB As are defined as usual. In [9], a result corresponding to 
the workspace theorem in formal language theory was proved for multisets. 

For an arbitrary multiset grammar G = (N, T, A, P) , let 

D : ai 02 Qffe 

be a derivation with ai G A. The workspace of the derivation D, denoted 
by WS{D), is the maximal weight of the multisets appearing in D, namely 
max{|| ai ||| i G [l..fc]}. Then, for j3 G Gen{G) we put 

WSaifd) = inf{IFS'(i:i) \ D : a f3,aG A}. 

We say that G has a linearly bounded workspace if there is a constant k such 
that for each non-empty (3 G Gen{G) we have WSg{P) < k\P\. 

Theorem 1 (Workspace Theorem) The class of macrosets generated by gram- 
mars with linearly bounded workspace equals the class of macrosets generated by 
monotone grammars. 

Based on this result, it is easy to prove the next result. Note that the second 
assertion, which can be easily proved, corresponds to one of the most important 
open problems in the formal language theory. 
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Proposition 4 1. mMON equals the class of macrosets accepted by MLB As. 

2. Deterministic MLBAs are strictly weaker than MLBAs. 

Proof. Let G = {N, T, A, P) be a monotone grammar; without loss of generality 
(see [9]), we may assume that A contains only one multiset iis over N U T 
(remember that S has to be a nonterminal and p.s is the multiset associated 
with the string S) and f^s is not a submultiset of any righthand side of the rules 
in P. We construct the MLBA A = {Q, T,N UT, f, qq, F), where 

Q = { 90 , <Z/} U {(/?, a) I there exists a rule p ^ t G P such that 
(3 and a are submultisets of r and ij, respectively} 

and the transition mapping / is defined as follows: 

f{qo, a) = f{{e,e),a) = {((/3, a),b) \ there exists a rule r] ^ t G P 
such that r(a) — /3(a) = 77 ( 6 ) — a{b) = 1 and 
r(c) — /3(c) = 77(c) — a(c) = 0 for all c € {N UT)\ {a, b, S'}}, 
for all aG{NUT)\ {S} 

/((e,e),S) = {(g/,e)} 

/((/3,a),a) = {{{P',a'),b) \ /3(a) - /3'(a) = a{b) -a'{b) = 1 and 

(3{c) — !3'{c) = a{c) — a'{c) = 0 for all c € {N U T) \ {a, &, S}}, 
for all a e (A^ U T) \ {S} 

/((/3,e),a) = {((/3',£),e) | /3(a) - /3'(a) = 1 and /3(c) - /3'(c) = 0 
for all c G {N U T) \ {a, b, S}}, 
for all a G (A^ U T) \ {S}. 

The set of final states contains the state qf, if the empty multiset is generated 
by G, then we add qo to the set of final states. 

It is easy to infer that, on each input multiset, the automaton simulates the 
application of the rules of G in the reverse order - the righthand side of a rule, 
which is identified in the bag, is replaced by the lefthand side of that rule - and 
accepts (enters g/) if and only if the contents of the bag is exactly 775 . Since G 
is monotone, the weight of the bag at any step is bounded by the weight of the 
input multiset, hence the automaton is linearly bounded. 

Conversely, let A = {Q, V, U, f, qo, F) be a MLBA; we construct the arbitrary 
multiset grammar G = {N, V, qis, P), where N = {S'} U M U W U Q. We denote 
by V the set |a' | a G M}. The set of multiset productions P consists of the 
following rules: 

1. p,s tiaa'S for all a G C, 

2. p,s ti-aa'qo for aU aev, 

3. Mga' ^ Hsb' if (s,b) G f{q,a), 

4. p,qa' /is if (s,e) G f{q,a), 

5. p,q ^ e if q € F . 
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The above grammar works as follows. In the first phase, by using the rules 
from the first two groups, itgenerates a multiset having just one copy of Qq and 
the same number of copies of a and a' for all a G V, respectively. Roughly 
speaking, we may say that the sentential form comprises two multisets, of equal 
weight, one of them being the primed copy of the other. The grammar now 
simulates the automaton A on the primed multiset. If the automaton accepts, 
the final state is erased from the sentential form and the result of this derivation 
is the non-primed multiset. Thus, a multiset over V is generated by G if and 
only if A accepts that multiset. We conclude the proof by mentioning that the 
workspace used by G for any derivation generating a multiset p is bounded by 
2 II p II +1, hence G has a linearly bounded workspace. 

2. The finite macroset r over the binary alphabet {a, 6}, containing the two 
multisets p,a and pb, cannot be accepted by any DM LB A. 



Remember that the extension considered in the previous section had no effect 
on the computational power of nondeterministic MFAs. The situation changes 
when considering the same extension for MLB As. By [9], all the macrosets over 
one letter alphabet in mMON are semilinear. The next example provides a de- 
terministic MLBAD (multiset linear bounded automaton with detection) which 
is able to recognize the non-semilinear macroset {2” | n > 1}. We list the transi- 
tion mapping only and indicate the set of final states, the other parameters can 
be deduced easily. 



Example 2 



/(go, a) = {(gi,e)} f{qi,a) = {(go,^)} 
f{qo,a) = {(<72, e)} /(gi,a) = |(gr,e)} 
f{qr, b) = |(ge, e)} f{qr, b) = |(g/, e)} 
f{q2,b) = {(g3,e)} f{q3,b) = {(g2,a)} 
f{q2,b) = {(<7o,e)} f{q3,b) = {(<7{,e)} 
/(g;,a) = |(ge,e)} /(g;,a) = |(g/,e)} 



The set of final states consists of the state qf only. One can easily check that 
the automaton iteratively halves the number of copies of a or b as long as this is 
possible. Then, it checks whether or not the previous odd number of copies was 
one, in which case it accepts, or reject, otherwise. 



Clearly, from the previous example it immediately follows that 

Proposition 5 1. MLB As are strictly weaker than MLBADs. 2. Deterministic 
MLBAs are strictly weaker than deterministic MLBADs. 



In order to get more information concerning the accepting power of ML- 
BADs, we consider here multiset random context grammars. The definition of 
a multiset random context grammar is a direct extension of the definition of 
a string random context grammar, see [15]. We mention that the multiset ran- 
dom context grammars contains only non-erasing context-free rules. The class of 
macrosets accepted by multiset random context grammars is denoted by mRC. 



Proposition 6 Every macroset in mRC is accepted by an MLBAD. 
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Proof. Let G = (N,T, fis^ P) be a multiset random context grammar. We con- 
struct the MLB AD A= {Q,T,TU N U N' , f,qo, {qf}), where N' is the alphabet 
containing the primed copies of all the letters in N, 

Q = {qo,qf}U{[X,Y,Z,p]\X,Y,Z CTUN}U 

{[X,Y,Z,p]\X,Y,Z cTUN,pCr for some a ^ t € P}. 

and the transition mapping / is defined as follows: 

— For each rule {P, F; a ^ t) G P we have 

- {[P,P,P,T - Pa],e) G /(go, a), a G TUfV, if || t ||> 2, 

- {[P, P, F],A) e /(go, a), if t = pa and a = pA, 

- ([P, P,F,p- pa],e) G /([P, P, P, p], a), a G T U iV, if II p ||> 2, 

- {[P, P, P],A) G /([P, P, P, p|, a), if p = Pa and a = pA-- 

- For each a G X we have 

* f{[X,Y,Z],a) = {([A\{a},r,Z|,a')}, 

*/([0,A,y|,a') = {([0,A\{a},y|,a)}, 

*/([0,0,A|,a) = {([0,0,A\{a}|,e)}, 

for all X,Y,Z CTUN. 

- /([0,0,0|,a) = {(go, a)} 

~ f{qo,S) = |(g/,e)}. 

It is rather easy to observe that 

Ps =^* p if and only if (go,p) h* (?/,£) 

for any multiset p over N U T. It follows immediately that Gen{G) = Rec{A) 
holds which concludes the proof. 

We point out here some open problems: 

1 . What is the relation between mARB and the family of macrosets accepted 
by MLBADs? 

2. The same question for PsMON and the family of macrosets accepted by 
MLBADs. 

3. Are deterministic MLBADs strictly weaker than MLBADs? 

4. What is the relation between the macroset classes recognized by determin- 
istic MLBADs and MLBAs, respectively? 

5 Multiset Turing Machines 

Multiset Turing machines, unlike MLBAs, have a bag able to store an infinite 
multiset and a read-write head which can pick up a symbol from the bag and 
add at most one symbol to the content of the bag. The working mode is defined 
exactly as for MLBAs. Such a machine accepts a multiset when it enters a final 
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state having in its bag an infinite number of B's and nothing else. Let us denote 
by P the multiset P{B) = oo and /3(a) = 0 for any other symbol a. 

Formally, a multiset Turing machine (shortly, MTM) is a construct 

M = (Q,V,U, f, go, B,F), 

where Q and F are the finite sets of states and final states, respectively, V and U 
are the input and the bag alphabets, respectively, V C U, go is the initial state, 
i? is a special symbol which occurs infinitely many times in the bag, and / is the 
transition mapping from QxU into the set of all subsets of Q x (([/U{e})\{i3}). 
A configuration is a pair (g, r), where g is the current state and t is the content 
of the bag. We write 

{Q,r) h {s,p) 

if and only if there exists a G V such that 

(z) (s, b) e f{g, a),b^e, r(a) > 1, p{a) = r(a) - 1, p{b) = r(6) + 1, 
and p{c) = t(c) for all c € F, c ^ {a, b}, or 
(a) (s,e) G f{g,a),T{a) > l,p{a) = r(a) - l,p{b) = T{b) + 1, 
and p{c) = t{c) for all c G V, c yf a. 

The reflexive and transitive closure of this operation is denoted by ^*. The 
macroset accepted by A is defined by 

Rec{M) = {t I ((7o,t) \=* {q,P) for some g G F}. 

The proof of the next result is left to the reader who can be successful by 
using the proof of Proposition 4 with some minor changes. 

Proposition 7 1. mARB egual the accepting power of MTMs. 2. Deterministic 
MTMs are strictly weaker than MTMs. 

The same extension defined in the previous two sections can be considered also 
for MTMs. Thus, one gets a modified version of a register machine. One can 
prove that: 

Proposition 8 1. MTMDs are stronger than MTMs. 2. PsARB eguals the class 
of macrosets accepted by MTMDs. 

Proof. The first statement is a consequence of the second one which follows from 
the fact that MTMDs are equivalent to register machines. 

It is not known whether or not deterministic MTMs are strictly weaker 
than deterministic MTMDs, neither whether deterministic MTMDs are strictly 
weaker than MTMDs. 
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6 Final Remarks and Further Directions of Research 

We have considered some variants of multiset automata whose working mode is 
based on the multiset addition and subtraction. These automata turned out to 
be suitable for an automata characterization of the Chomsky-like hierarchy of 
multiset grammars introduced in [9]. 

Several interesting problems listed in the previous sections remain to be 
solved. Besides them, there are lots of problems concerning multiset automata, 
coming from different possible directions of further research. We briefly discuss 
below some of them. 

We start with a remark which could make an essential distinction among 
MLBADs as well as MTMDs: In the definition of an MLBAD, for instance, 
there is no restriction when checking the existence of a given symbol in its bag; 
it may add or not a symbol to the content of the bag. It appears natural to 
consider that whenever it checks the existence of a symbol in its bag it cannot 
add anything to the bag. Does this restriction diminish the computational power 
of these variants? 

It seems to be of interest to apply the program we used in this paper to 
other types of automata: multi-head automata, multi-tape automata, contraction 
automata, etc. 

Another attractive direction of investigation is to consider the complexity of 
the automata introduced in this paper. This study may develop in two directions: 
descriptional complexity of these automata with respect to different measures 
(states, transitions, etc.) and computational complexity of them with respect to 
time/space resources they need. 
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Abstract. The Parikh mapping maps each word over an alphabet with 
n letters to an n-dimensional vector whose components give the number 
of occurrences of the letters in the word. We consider the Parikh images of 
sequences and languages obtained by iterated applications of morphisms 
(or sets of substitutions). Furthermore we modify the Parikh mapping 
such that it can be iterated and study the corresponding sequences. 



1 Introduction 

Let V be an alphabet. By #(V) and V* we denote the cardinality of V and the 
set of all words over V including the empty word A, respectively. We consider a 
fixed order of the elements Oi, 02 , . . . , a„ of IL and define the Parikh mapping^ 
Pv-.V* ^ N” by 



Pv{w) = (#ai(w),#a 2 (w),---,#o„(w)), 

where, for a word w G V* and a letter a € V, we denote the number of occur- 
rences of a in w by #a{w) and N denotes the set of non-negative integers. If V 
is clear from the context, we write p instead of pv- 

This mapping has also a meaning in multiset theory. Instead of considering a 
word w, i.e., an ordered sequence over V, we regard the unordered version, i.e., 
the multiset consisting of ^a{w) elements a. 

In this paper we consider combinations of the Parikh mapping and the well- 
known concept of iteration. There are at least two possibilities for this combina- 
tion. 

1 . The first one is to regard an iteration process on a word leading to a sequence 
of words and to look for the Parikh images of these words. The simplest 
iteration process is the iterated application of a morphism. We mention that 
this idea has - besides its mathematical attraction - also some biological 
interest (for details we refer to [8]). We shall investigate this approach in 
Section 2. 

2. The second one consists in an iteration of the Parikh mapping itself. In order 
to ensure this we have to consider the vector py{w) as a word. This restricts 

^ Named after R. J. Parikh 
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the alphabet; the elements of V have to be digits in order to represent the 
numbers of occurrences and the comma and the brackets. However, since the 
number of occurrences of the comma equals n — 1 and each bracket occurs 
exactly once, we modify the Parikh mapping using numbers as delimiters 
between the numbers of occurrences. In Section 3 we shall give the detailed 
definitions and shall give some results on the sequences obtained in this way. 

2 Parikh Sequences and Parikh Languages 
Obtained by Iterating Functions 

2.1 DOL Systems 

A DOL system^ G is a triple G = {V, h, w) where 

— P is an alphabet, 

— is a morphism which maps V* into V*, and 

— ru is a non-empty word over V. 

By the iterated application of ft. to w we obtain an infinite sequence 
S{G) = {w, h{w), h^{w), h^{w), ...) 



and a language 

L(G) = {ft*H I i > 0} . 

Here we are interested in the Parikh images of S{G) and L{G), i.e., in the 
sequence 

PS{G) = {p{w), p{h{w)), p{h^{w)), p{h^{w)), ... ) 

and the set 

PL{G) = {p{h\w))\i>0}. 

To illustrate the concept we consider the following example. 

Example 1. Let G = (V,h,w) be the DOL system with V = {a,b,c,d}, w = c 
and ft given by 

ft(a) = a, ft(&) = ab, h{c) = bed, h{d) = da . 

Then we obtain 

ft(c) = bed, h^{c) = h{bed) = abbedda, ft^(c) = aababbcddadaa 
and, in general, for i > I, 

ft*(c) = . . . aababbcddadaa . . . . 

^ This is the abbreviation of deterministic (D) interactionless (0) Lindenmayer (L) 
system. We refer to [8] and [11] for a detailed discussion of Lindenmayer systems 
including their biological motivation. 
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Therefore, for z > 1, 






p{W{c)) = (2 • ^ j, z, 1, z) 
i=i 



(z(z - 1), i, 1, z) . 



Hence we get 



P5(G) = ( (0,0,l,0),(0,l,l,l),(2,2,l,2),(6,3,l,3),...,(z(z-l),z,l,z),...) 



and 

PL{G) = {(0,0, 1,0)} U |(z(z - l),z, l,z) I z > 1} . 

Let G = {V, h, w) be a DOL system with V = (oi, 02 , • ■ • Onj- For two letters 
a and b of V and z G V~^, the number of occurrences of a in h{z) generated by 
one occurrence of 6 in z is 4j^a{h{h)). Therefore 

#„(/z(2)) = ^#,(z)-#a(/z(6)). 

bev 

We now define the growth matrix Mq associated with G as the n x n matrix 
(zzzfcj) where mk,j = Then 

p{h{z)) =p{z)Ma 



and, for z > 0, we can prove by induction 

p{h\w)) =p{w) ■ 

We now discuss the status of decidability of the following decision problems: 

— Parikh membership problem: Given a vector v G N" and a DOL system 
G = {V, h, w) with #(H) = n, decide whether or not v G PL{G). 

— Parikh finiteness problem: Given a DOL system G, decide whether or not 
PL{G) is finite. 

— Parikh sequence equivalence problem: Given two DOL systems G and G', 
decide whether or not PS{G) = PS{G'). 

— Parikh language equivalence problem: Given two DOL systems G and G', 
decide whether or not PL{G) = PL{G'). 



Theorem 1. The following problems are decidable for DOL systems: 

i) the Parikh membership problem, 

ii) the Parikh finiteness problem, 

Hi) the Parikh sequence equivalence problem, and 
iv) the Parikh language equivalence problem. 

Proof. We only prove ii) and iii). i) can be shown by analogous considerations. 
For a proof of iv) we refer to [9]. Moreover, we give proofs based on the Parikh 
vectors. Using the fact that a set of Parikh vectors is finite if and only if the 
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corresponding set of words is finite, one can give another proof for ii) based on 
the decidability of the finiteness of the word set. 

ii) It is well-known that any infinite set of vectors of N” contains at least 
two vectors x and y such that x < y or y < x, where the order on N" is defined 
componentwise (see [1]). 

Now let G = (V,h,w) be a DOL system with #(F) = n. Then we can 
determine two integers i and j such that i < j and p{K^{w)) and p{h^{w)) are 
comparable with respect to the partioal order on N". 

Case 1. p{h\w)) < p{h^{w)). Then 

p{C^^{w)) — p{y^^{w)) = p{C{w))Mg — p{y {w))Mg 

= {p{C{w)) - p{k> { w)))Mg 
> 0 

(0 denotes the zero-vector where all components are 0). This implies p{h^'^^{w))< 
p{C'^^{w)) and, by induction, we obtain 

p{h^^'^{w)) < p{C^'"{w)) for r > 0 . 

Therefore the set of maximal vectors of PL{G) is contained in the initial part 
{p{w) , p{h{w)) , p{h'^ (w)) , . . . ,p{h^{w))}. Thus, PL{G) is finite. 

Case 2. p{h^{w)) < p{h^{w)). Then, as above, we can show that 

p{h^^'^{w)) > p{K^^'"{w)) for r > 0 . (1) 

Now we consider the sequence 

<?o =p{h\w)) =p{w)M}j, 

qi = p{h^{w)) = p{h'‘^^^~^\w)) = = qoMlf\ 

q2 = p{h^+^^^-^Hw)) = = qiMiC\ 

Q3 = p{h^+^^^-^Hw)) = qoM§^-^^ = 



etc. By (1), 



Moreover, we set 



9o > <7i > 92 > 93 > • • ■ ■ 



do = qi- qo, 

di = q2 - qi = - qoM^(f" = 

= 93 - 92 = 92A^g~* “ qi^G~" = diM^G~'‘ = do{M^G~")'^^ 

c^3 = 94 - 93 = d2M^(f" = do{M^(f"Y, 



( 2 ) 



etc. 

We now prove that PL{G) is finite if and only if = 0 for some k. 
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First let us assume that PL{G) is finite. By qi G PS{G) for / > 0 and (2), 
there is a number k such that = qk+i- This implies = 0. 

On the other hand, let dk = 0 for some k. Then dr = dk{M^(P^Y~^ = 0 for 
r > k. Hence qk = qk+i = <?fc +2 = ... or equivalently 

q_r = <lk for r > k (3) 

We now have by definition, known properties of matrix multiplication and finally 

( 3 ) 

= {dr ~ dk)MQ = OMq = 0. 



This implies 

= p(h^+Pi-^)+\w)) for r > A: and 1 < t < (j - i) 

and 

PL{G) = {p{w),p{h{w)),p{hYw )), . . . ,p(/r*+'=0-*)+0--i)} . 

The last statement proves the finiteness of PL{G). 

Obviously, the sequence {do,di,d 2 , ■ ■ ■) is generated by the DOL system H = 
(V, v) where u is a word with p{v) = do (the growth matrix of is Mq~^). 
As above, there are integers F and j' with i' < f and comparable vectors di' 
and dj'. Again, if dji < di>, then 

{do , di , ^2 , . . ■} {do, di , . . . , dj/_|_^j/_ } 

and we can easily test whether this set contains the zero- vector 0. If di> < dj>, 
then we get d^+r < dj>+r for r > 0, and dfc = 0 for some k implies dk' = 0 for 
some k' < j' + {j' — i' — 1), i.e., we can check whether dk = 0 holds for some k. 

iii) Let G = (V,h,w) and H = (V,g,v) be the two given DOL systems. 
Further let n = #(fo). We prove that 

PS{G) = PS{H) if and only if p{hd{w)) = p{g'^{v)) for 0 < f < n. 

Obviously, this criterion is effective and gives an algorithm to decide the Parikh 
sequence equivalence. 

By definition, PS{G) = PS{P[) implies p{h^{w)) = p{g'‘{v)) for any t > 0. 
Therefore p{h^{w)) = p{g^{v)) holds for 0 < f < n. 

Let us now assume that p{h^{w)) = p{g''{v)) for 0 < i < n. We determine 
the smallest r such that the vector p{K'{w)) is linearly dependent on the vectors 
p{w),p{h{w)),p{h?{w)), . . . ,p{h^~^{w)). Since all Parikh vectors are elements of 
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the n-dimensional vector space Z", we have r < n. We now show that, for s > r, 
there are a^GZ, 0<z<r — 1 such that 



P{h"{w)) = ^a,p{h\w)) = ^aip{g\v)) =p(/(n)) 



i=0 



i=0 



i.e., p{h‘^{w)) is a linear combination of p{w),p{h{w)),p{h?{w)), . . . ,p{h^~^{w)), 
p{h'^{w)) = p{g'^{v)), and p{g^{v)) can be represented as a linear combination 
using the same coefficients. We use induction on s. For s = r the statement 
follows by the choice of r and the supposition. The induction step follows from 
the following relations: 

= p{h“{w))MG 

= I ^ aip(/i*(w)) ) Mg (by induction assumption) 

\i=0 / 

r— 1 r— 1 

= '^a,p{h"{w))MG = 

i—0 i—0 

r-2 

= ar-ip{h'^{w)) + ^ aip{h''~^^{w)) 



i^O 

r-2 



= ^ a,p{h^+^{w)) 



2=0 



2=0 



(by induction assumption for r) 



= f3op{w) + + a^-l)p{h''{w)) 



2=1 

r—1 



= PoP{v) + + a^-i)p{g\v)) 

2=1 

(by p{h^{w)) = p{g^{v)) for 0 < j < r) 

r—1 

— ^ aip{g^^^ (v)) (do the above steps backwards) 



2=0 

r—1 



^r— 1 



= ^0'tP{g\v))MH = ^a^p{g\v)) Mh 



2=0 



^2 = 0 



= p{g^{v))MH (by induction assumption) 

= p(5®+^(?;)) 



2.2 Some Generalizations 

In the preceding subsection we considered DOL systems where a language is 
generated by iterated applications of a morphism to a word. In order to generalize 
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this concept one can use another mapping or several morphisms or a combination 
of both these ideas. Thus we get a TOL system which is formally defined as 
follows. 

A TOL system is a triple G = {V, P, w) where V and w are specified as in a 
DOL system and P = {ti,T 2 , ■ ■ ■ , Tr} is a set of n finite substitutions n : V* ^ 
V*, 1 < i < r. The language L{G) generated by G is defined as 

L{G) = ■ . in^iw)) ...)) \ m > 0,1 <ij <r,l <j <m}. 

The associated Parikh language PL{G) is given by 

PL{G) = {p{v) I V G L{G)} . 

We say that a TOL system G = (P, {ti, T 2 , . . . , Tr}, w) is a 
~ DTOL system, if is a morphism for 1 < t < r, 

- OL system, if r = 1. 

Then, a DOL system satisfies r = 1 and Ti is a morphism which gives precisely 
our definition above. 

A TOL system is called propagating or A-free, or shortly PTOL system, if 
A ^ Ti(V) for 1 < i < r. Adding this restriction to TOL, DTOL, OL and DOL 
systems, we obtain PTOL, PDTOL, POL and PDOL systems, respectively. 

Example 2. We consider the PDTOL system G = ({a, 6}, {/ii, / 12 }, a) where the 
morphisms are given by h\{a) = a? , h\{b) = b, / 12 (a) = b^, h 2 {h) = b. Obviously, 
iterating hi we get all words with i > 0. If we apply /12 then we obtain a 
word b^'^ and this word is not changed by further application of one of the 
morphisms. Therefore 

L(G) = I z > 0} U I z > 0} 

and 

PL{G) = {{2\ 0) I z > 0} U {(0, 3 • 2*) I / > 0} . 

Now we show that PL{G) is not the Parikh language of some OL system. Assume 
that PL{G) = PL{H) for some OL system H = {{a,b},T,v). Obviously, if z G 
r(a), then p{z) = (a;,0) or p{z) = (0,y) for some non-negative integers x and 
z/. If X = 0 or y = 0, then we can generate from a word G L{Pl) (note 

that (2*,0) G PL{G) = PL{Pl)) the empty word A. Thus A G L{P[) but (0,0) ^ 
PL{H) . Analogously, we can show that p{z') yf (0,0) for any z' G r(6) . Therefore, 
V = a has to hold. If p{T{a)) C N x {0}, then we cannot generate any word with 
a Parikh vector in {0} x N. Hence we have (x,0), (0,z/) G r(a) for some x > I 
and z/ > 1. Thus we can generate the word a^b^ from G L{P[) (since (2, 0) G 
PL{G) = PL{H)). Therefore p{a^b^) = (x,y) G PL{H), but (x,y) ^ PL{G) in 
contradiction to PL{H) = PL{G). 

Example 3. We now consider the OL system G' = ({a}, r, a} with r(a) = {a, a^}. 
Then it is easy to see that 

L{G') = {a^ \j>l} and PL(G') = {(j) | J > 1} • 
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VC{TQL) 




PC{DT0L) PC{0L) VC{PTQL) 



* 





VC(DQL) VC{PDT0L) VC{PQL) 




VC{PDQL) 



Fig. 1. Hierarchy of the Parikh language families of L systems. VC[X) VCiY) 
means the proper inclusion PC{X) C PC{Y), and if two families are not connected, 
then they are incomparable. 



We prove that PL(G') cannot be generated by some DTOL system. Assume 
that PL{G') = PL(H') for some OL system H' = ({a}, {hi, h- 2 , . . . hn}, v'). Since 
the underlying alphabet only consists of one letter a, any morphism hi is given 
by hi{a) = a^' for 1 < f < n. Without loss of generality, we can assume that 
fci yf 1 for 1 < z < n since the identity does not change the words. Moreover, 
fcj yf 0 for 1 < z < n since we have (0) ^ PL{P[') and therefore A ^ L(H'). Thus 
any application of a morphism hi strongly increases the length. This implies 
v' = a. Then 

L{H') = I (Zj > 0, 1 < j < rz} 

and 

PL{H') = {{kfkf . . . e) U, > 0, 1 < j < n} . 

Now let p be a prime number such that p > ki for 1 < z < n. Obviously, 
(p) ^ PL(H') in contrast to (p) G PL(G'). 

For A G {DOL, OL, DTOL, TOL, PDOL, POL, PDTOL, PTOL}, we denote the 
family of Parikh languages generated by X systems by VC{X). 

By definition, we obtain VC{PX) C VC{X) for A G [DOL, OL, DTOL, TOL} 
and VC{YT0L) C VC{Y0L) for Y G {A, P, D, PD}. Moreover, by the Examples 
2 and 3 we have the incomparabilty of V C{DT0L) and VC{0L). Without proof 
we summarize the relations between the families of Parikh languages generated 
by different types of OL systems. For a complete proof we refer to [2]. 

Theorem 2. The diagram of Figure 1 holds. 

With a morphism h we can associate a relation Rh by 

{w, v) G Rh if and only if v = h{w) . 
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Further, the iteration of the morphism h corresponds to the reflexive and transi- 
tive closure of the relation R^. Thus the language L{G) generated by a DOL 
system G = (V,h,w) is the set {z \ (w,z) € R^j- Analogously, we can redeflne 
the languages obtained by iteration of a substitution or a set of morphisms or 
substitutions. This leads to the general concept. 

A language generating device is a triple G = (V, R, w) where V and w are 
specified as in a DOL system and i? is a binary relation on V*. The language 
and the Parikh language generated by G are defined as 

L{G) = {z I {w, z) G R*} and PL{G) = {p{z) \ z G L{G)} , 
respectively. 

There is not too much knowledge on Parikh languages of language generating 
devices. We here only mention some results on context-free grammars which are 
a special language generating device deflned as follows. 

A context-free grammar is a quadruple G = (N, T, P, S) where N and T are 
disjoint finite non-empty sets, P is a finite subset of x {N U T)* and S is 
an element of N. We define a relation on {N U T)* by {w,w') G=^ if and 
only if the following conditions are satisfied: w = uAu' , w' = uvu' for some 
words u,u' G {N U T)* and (A,u) G P. As usual, we write w w' instead of 
{w,w') G=^. Then the language and Parikh language of G are deflned as 

L{G) = {z\zGT*,S z} and PL{G) = {p{z) \ z G L{G)} , 
respectively. 

Example 4- Let 

G = ({S', A, B}, (o, b, c}, |(S, AB), {A, aAb), (A, ab), (P, aBc), {B, ac)}, S) . 

Since from A we produce pairs of letters a and b and from B pairs of a and c, 
we get 

L(G) = |a"6”a'"c™ | n > l,m > 1} 

and 

PL{G) = {{n m,n,m) | n > 1, m > 1} . 

In order to characterize the Parikh languages of context-free grammars we 
need the following notions. 

A set M C N" is called linear, if there are a natural number k and vectors 
Xq,Xi, . . . ,Xk such that 

k 

M = {xo -I- ^ aiXi I Oi G N, 1 < f < fc} . 

i=l 

M is called semilinear iff it is a finite union of linear sets. 

For instance, PL{G) of Example 4 is a linear set since 

PL{G) = 1(2, 1, 1) -k n(l, 1, 0) -k m(l, 0, 1) | n > 0, m > 0} . 

Due to R. J. Parikh (see [10]), the following characterization of context-free 
Parikh languages is known. 
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Theorem 3. i) For any context-free grammar G, PL{G) is a semilinear set. 

ii) For any semilinear set M, there is a context-free grammar G such that 
M = PL{G). 

Proof. For a proof of i) we refer to [12]. However, we note that the proof is 
constructive, i.e., it determines the vectors which describe the linear sets whose 
union is PL{G). 

Let G and G' be two context-free grammars. Then there is a context-free 
grammar H such that L{F[) = L{G) U L{G') (see [12]). Moreover, 

PL{H) = p{L{H)) = p{L{G)UL{G')) = p{L{G))Up{L{G')) = PL{G)UPL{G') . 

Therefore it is sufficient to prove that, for any linear set M there is a context-free 
grammar G with PL{G) = M. 

Let 

k 

M = {xo -b ctiXi I Oi G N, 1 < z < A:} C N” 

i=l 

for some vectors xq,xi, . . . ,Xk. Further, let T be an alphabet with #(T) = n 
and, for 0 < z < fc, let G T* be a word with priwi) = Xi. Then we construct 
the context-free grammar 



fe-i 

G = {{Ai \ l<i<k},T, {Ak Wk, Ak zco}U (J {Ai Wi, Ai Ai+i}, Ai) . 

i=l 



Then we get 



Ai wiAi wiWiAi . . . 

w"f^W2A2 W2W2A2 



. . . A 2 



W^^W2^W3A3 W^^W2^W3W3A3 



rii no nq 
Wn W 






ni^^n2^,n3 



gj2 bc/g -rl3 k W-^ ^2 

-vl’^Ak 

W'{^W'f"W 3 n 3 . . . = z . 



A 3 



w'f^W2^W3n3. . .w'f’Ak 



W-i Wn 






Since 

Pt{z) = X 3 + nixi + U2X2 . . . UkXk , 
it is easy to prove that M = PL{G). 

We mention the following decidability properties. 

Theorem 4. The following problems are decidable for eontext-free grammars: 

i) the Parikh membership problem, 

ii) the Parikh finiteness problem, 

Hi) the Parikh language equivalence problem. 
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Proof, i) Given G we first construct the semilinear set M . To check whether or 
not a given vector belongs to M it is sufficient to check whether or not it belongs 
to one of the linear sets whose union is M. The last check can be done by solving 
a system of linear equations. 

ii) PL{G) is finite if and only if L{G) is finite. The finiteness of context-free 
languages is decidable (see [12]). 

iii) We have to check the equality of two semilinear sets which can be done 
algorithmically (see [5]). 

For further information on semilinear sets (in connection with languages) we 
refer, for example, to [5], [6] and [7]. 

3 Iteration of a Modified Parikh Mapping 

In this section we study the iteration of a modified version of the Parikh map- 
ping. The modification has to ensure that the image is contained in V*, again. 
Therefore the numbers of occurrences have to be considered as words which can 
be done by considering gr-adic representations of the numbers. 

We consider the alphabet 

V, = {0,l,2,...,q-1}. 

Then the representation w(m, q) of the integer m in the base g, g > 2, is a word 
over Vq and we define a{w{m, q)) as the string 

w{#o{w{m, q)),q)Qw{#i{w{m, q)),q)l... w(#g_i(w(m, q)),q){q - 1) 

(we count, in the base q, the number of occurrences of the digits in the word 
w{m, q), and we used (g — 1) instead of g — 1 in order to ensure that the digit 
g — 1 is meant). In the sequel we shall use the notation 

■u;(#o(w(m, g)), g)0w(#i(w(m, g)), g)l . . . g)),g) g- 1 

where the digits used as delimiters are underlined. 

The Parikh mapping of w(m, q) gives 

pv,{w{m, g)) = ((#o(w(m, q),ffi{w{m, g)), . . . , #q-i{w{m, g))) , 

i.e., essentially, we use the comma instead of the delimiters and the numbers of 
occurrences instead of the representations of the numbers of occurrences. 

The function a can be iterated, and, for given m, p as above, we obtain the 
sequence a^{w{m, g)), j = 0, 1, 2, ..., of strings over the alphabet Vq. The set of 
all these strings will be denoted by L{m, q). 

Example 5. We take the base 2 and start with m = 140. Therefore, 



w{m,q) = 10001100. 




96 



Jurgen Dassow 



We have 5 occurrences of 0 and 3 occurrences of 1, hence 



a(w(140, 2)) = w{5, 2) 0 w{3, 2) 1 



10 10 111 . 



Now, 



a^{w{U0, 2)) = w{2, 2) 0 w{5, 2) 1 
a3(w(140, 2)) = u>(3, 2) 0 w(4, 2) 1 
a^(w(140, 2)) = u>(3, 2) 0 w(4, 2) 1 



10 0 10 11 , 

110 10 0 1 , 

110 10 0 1 . 



Since a^(w(140,2)) = o;‘*(w(140, 2)), we obtain 



a*(w(140,2)) = 1 1 0 1 0 0 1 for i > 3 



and 

L(140,2) = {10001100,1010111,1001011,1101001}. 

In our example L(140, 2) is a finite language. We shall prove that this holds 
for all m and q as above. 

Theorem 5. For all positive m and for all q >2, the language L{m, q) is finite. 

Proof. In a{w{m,q)) some substrings w{ffi{w(rn,q)),q), 0 < i < q — 1, can 
be equal to 0 (no symbol i appears in w{m,q)), but in a{w{m, q)) each digit 
0, l,...,(p — 1) appears at least once and similarly in all subsequent strings 
{w{m,q)), j > 2. Thus, in 

a^+^{w{m,q)) = w{ffo{aHw{m,q))),q) 0 ... w{ffq-i{a\w{m,p))),q) q-1 

all the substrings w{ffi{a^{w{m,p))),q), 0 < i < q — 1, correspond to non-null 
numbers. 

We now compare the length of the strings (w(rn,q)) and a^'^^{w{m,q)) 
for j > 2. Assume that {w{m,q)) contains ti occurrences of the digit i, i G 
{0,1,..., (g- 1)}. Then 



q-1 

\a^{w{m,q))\ = 

i=0 

q-1 

^+^{w{m,q))\ = E \w{ti,q)\ + q. 
2 = 0 



Clearly, 

\w{U,q) \ = [log^ti] -h 1, 



hence 



9-1 



\a^+^{w{m,q))\ = [log, ti] +2q. 



2=0 



(4) 
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Trivially, [log^ ti] < ti, but for s > we have 

log, s <s-2q 

(indeed, if r > 1 and s = rp^, then log, rq^ = log, r + 3 < r{q^ — 1) — 2q + r, 
because log, r < r and 3 < r{q^ — l) — 2q for all g > 2, r > 1). We now consider 
a{w{m,q)) with the length strictly greater than q^. At least one of the above 
considered numbers ti, 0 < t < p — 1, is strictly greater than q^ (the pigeon 
principle), hence for such a number ti we have 

[log, U] <U-2q. 

By (4), this implies 



p-i 

\a^+^{w{m,q))\ <^t, = \a^w{m,q))\. 

i=0 

In conclusion, starting from a{w{m, q)) of some length greater than the length 
of q)), for j = 2, 3, ... decreases strictly, until obtaining a string of length 

smaller than g^. 

Conversely, having a short string a{w{m, q)), the length of a^{w{m,q)), j = 
2,3,... can increase several steps, but each such increase is at most 2g, the 
constant term in relation (4). Therefore, the length can increase at most until 
g'^ + 2g, when it must again decrease. 

In conclusion, the language L(jn, q) contains only strings of length at most g‘*+2g, 
plus possibly a finite number of strings of greater length, when starting from a 
string of greater length. Hence this language is finite. 



By this proof we know that, for words such that for some i, 1 < f < g — 1, the 
number ti of occurrences of i in w{m, g) is larger than g^, the length is decreased 
by a. We now want to improve this bound. 

Let g > 3, and let us assume that 3q < ti < q^ for some i. Then 



[ti] < 3 < g = 3g — 2g < ti — 2g. 

By (4), it follows that \a{w{m,q)) \ < |'u;(m, g)|. 

By arguments analogous to those in the proof of Theorem 5, we obtain 3g • 
g + 2g = 3g^ + 2g as a lower bound for the length of words whose length is 
decreased by a. 

By the finiteness of L{m,q), the problem of the structure of the sequence 
(w{m,q)), j > 2, arises. Two situations can appear: 

— there is a string {w{m,q)) with a^~^^{w{m,q)) = {w{m,q)) for fc > 1 

(we call such a string a^{w{m,q)) singleton-attractor) 

or 

— there are strings q)), s = l,2,...,k, such that a^+^(w(m, g)) = 

a^{w{m, q)) and all strings g)), s = 0, 1, 2, ..., fc — 1, are pairwise 

different 

(we say that a cycle- attractor appears). 
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Example 5 shows that a^(w(140,2)) = 1101001 is a singleton-attractor. 

Example 6. Let now q = 3 and m = 2189. Then we obtain 

z = w(2189,3) = 10000002, 
a{z) = 2001112, 
a‘^{z) = 2010122, 
a^(z) = 2021102, 
a^(z) = 1021102, 
a^{z) = 2010122 = a2(^). 

Therefore we have a cycle-attractor which consists of three elements. 

How many singleton-attractors and cycle-attractors exist for a given base pi 

For a given base p, are there both singleton- and cycle-attractors? Are there 

cycles of arbitrary large size? We have only partial answers to these questions. 

Theorem 6. For each base q >2, there is at least one singleton- attractor. 

Proof. For q = 2, hy our Example 5, the statement holds. 

For <7 > 3, we consider the string 

z = 10111221314... 1 p-2 1 p-1 . 

Obviously, a{z) = z. 



Theorem 7. For each base q>S, there is at least one cycle- attractor (consist- 
ing of two strings). 

Proof. Consider the strings 

= 10 (p- 3) 142131415... 1 p-3 2 p-2 1 p-1 , 

Z2 = 10 (p- 2) 122132415 ... 2 p-3 1 p-2 1 p-1 . 

It is easy to see that a(zi) = Z 2 and a(z 2 ) = zi. Hence we have a cycle. 

For <7 = 8, by this construction, we obtain the cycle 

105142131415261 7, 
106122132425161 7. 

By the proof of Theorem 5, we obtain that any attractor contains a word z 
such that, for 1 < i < (7 — 1, 






8 for g = 2 
3(7 for q>3. 



The following theorem improves this relation. 
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Theorem 8. Let q >6. Let z be in some attractor. Then 

{ 2 i = 0 
q + 3 1=1 
, , = 2 ■ 
q — I i > 3 

For a proof of Theorem 8 we refer to [3] 

Theorem 9. For any q > 4, any attractor contains a string z with #o{z) = 1. 

Proof. Since we are interested in elements of attractors we can assume that z is 
obtained by application of a, i.e., z can be represented in the form 

The number 0 < i < g — 1, is called the i-th coefficient of z. Moreover, 
ffk{z) is the fc-th coefficient of a{z). 

By Theorem 8, it is sufficient to show that there is no attractor with = 2 
for all elements z of the attractor. 

Let us assume that such an attractor A exists and let z be an element of A. 
Then one coefficient of a~^(z) contains a zero. By Theorem 8, this coefficient has 
to be q and has to be the coefficient of 1 or 2. By assumption, these relations also 
hold for a“^(z). Moreover, q — 1 occurrences of 1 or 2 exist in the coefficients of 
a“^(z) depending on (a“^(z))^^^ = q or {a~^{z)Y‘^^ = q, respectively. By these 
facts, a“^(z) is one of the following four elements: 

20101222324 . . . 2 g-2 2 g- 1 , 

20211022324 . . . 2 g-2 2 g- 1 , 

20101121314 . . . l g-2 1 g- 1 , 

20111021314 . . . l g-2 1 g- 1 . 

It is easy to compute that in all four cases (a(z))^°^ = 1. 

Theorem 8 and Theorem 9 give the possibility to determine by the computer 
all single attractors and all cycle-attractors for small values. The result of our 
computation of single-attractors is given in the table of Figure 2. 

Generalizing the single attractors which we found for q G {7,8,9} it is easy 
to see that, for q>7, there is the single attractor 

10 (P - 3) 132231415... 1 (p-4) 2 (p - 3) 1 (p - 2) 1 (p - 1) 

For q G {2,4,5}, there is no cycle-attractor. For q = 3, there is only the cycle- 
attractor consisting of the elements 

10 0 11 1 1 2, 2 0 12 1 1 2, 1 0 10 1 10 2, 

which we found already in Example 6. For q = 6, there is exactly one cycle- 
attractor, namely 
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Fig. 2. List of all single-attractors for 2 < q < 9. 



10314213241 5, 10412223241 5, 

and for q = 7 we found the only cycle-attractor 

1041421314251 6, 1051221334151 6, 1051222314251 6. 

For p = 8 and p = 9 there exist only the cycle-attractors of Theorem 7. 

Results on further modified functions can be found in [4] . 
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Abstract. Multisets are the fundamental data structure of P systems. 
In this paper we relate P systems with the language and theory for 
multisets presented in [9.] This allows us, on the one hand, to define and 
implement P systems using multiset constraints in a constraint logic pro- 
gramming framework, and, on the other hand, to define and implement 
constraint solving procedures used to test multiset constraint satisfiabil- 
ity in terms of P systems with active membranes. While the former can 
be exploited to provide a precise formulation of a P system, as well as 
a working implementation of it, based on a first-order theory, the latter 
provides a way to obtain a P system for a given problem (in particular, 
NP problems) starting from a rather natural encoding of its solution in 
terms of multiset constraints. 



1 Introduction 

The notion of multiset has been recently re-discovered, analyzed, and employed 
in various areas of Logic and Computer Science. From the point of view of Com- 
puter Science, a multiset is nothing but a data structure that can contain ele- 
ments; differently from sets, the number of occurrences of each element is taken 
into account and differently from lists, the order of the elements does not matter. 
This peculiarity makes them suitable for modeling phenomena where resources 
are generated or consumed. For instance, multisets are used to model linear logic 
proofs, where tokens are consumed to deduce new ones [24]. Multisets are the 
fundamental data structure of a number of computational frameworks, such as 
the Gamma coordination language [3], the Chemical Abstract Machine [4], and 
P systems modeling membrane computing [21]. In all these three approaches, 
which have been developed with different aims, multisets are viewed as entities 

* Partially supported by MURST project Certificazione automatica di programmi me- 
diante interpretazione astratta. 
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containing data (possibly, other multisets), and programs are, basically, collec- 
tions of multiset rewriting rules that can be applied with an high degree of 
parallelism. 

Efforts have also recently been put forward for introducing multisets as first- 
class citizens in programming languages. Their high level of abstraction makes 
it natural to introduce multisets into declarative programming paradigms using 
constraint (logic) programming languages (e.g., [2,9]). In order to allow the user 
to write declarative executable code, the interpreter of the language should be 
able to find the solutions of (i.e., solving) the formulae (i.e., the constraints) 
written by the user to describe the problem at hand. The correctness of such a 
solver must be proved on the basis of the properties that characterize multisets. 
Thus, it is important to develop a simple (first-order) theory of multisets that 
forces the desired behavior of the multiset data structure. If the solver works 
correctly on a minimal theory, then it will work correctly in any consistent 
extension of this theory. In [9] a minimal theory (called Bag) of multisets is 
developed parametrically together with minimal theories for lists, compact lists, 
and sets, and a solution to the multiset unification problem is provided (see 
also [8]). In [10] the results are extended to more complex constraints and the 
constraint logic programming language CLP {BAG), able to deal with this kind 
of constraints, is presented. 

This paper aims at establishing a connection between P systems and the 
constraint language of [9] based on multiset processing as the common denomi- 
nator of the two proposals. The approach adopted is, as usual, to show how to 
define one formalism in terms of the other one and vice versa. Precisely, first 
we show how to define P systems using the language and the theory for multi- 
sets presented in [9]. This provides a precise formulation of P systems inside a 
first-order theory. Moreover, it allows us to define a translation from P systems 
into CLP{BAG) programs that can be used as a simple, executable — though 
sequential — implementation of P systems. 

In defining this mapping we will exploit the flexible multiset processing facil- 
ities offered by our multiset constraint solver, in particular multiset unification. 
This kind of “semantic” unification can be used not only as a means to compare 
two multisets, but also as an easy way to arbitrarily select a subset of elements 
satisfying some property from a given multiset. The possibility offered by our 
constraint language to deal with also partially specified multisets turns out to be 
essential to allow a concise definition of P systems inside our logic framework. 

On the opposite side, we show how to implement part of the constraint solver 
of CLP {BAG) — namely, the fundamental part dealing with equality constraints 
(hence, multiset unification) — using P systems. The extension of this imple- 
mentation to the whole constraint solver of CLP {BAG) is trivial. Thus, once 
we have a description of the problem expressed as a multiset constraint over 
B-Bag we also have immediately a corresponding implementation as & P system. 
This constitutes an interesting viable alternative to obtain P systems algorithms 
for all those problems (typically, NP-complete problems) which admit a rather 
“natural” formulation based on multiset constraints. 




Multiset Constraints and P Systems 



105 



The paper is organized as follows. First we review the above-mentioned multi- 
set language, the axiomatic theory Bag, the constraints allowed in that language, 
and the constraint logic programming language CLP(BAG) (Section 2). Then, 
in Section 3 we show how to use the multiset constraint language to define P 
systems and in Section 4 we use the CLP(BAG) language to provide a simple 
implementation of P systems. In Section 5 we show, instead, how both the stan- 
dard unification and the multiset unification algorithms can be encoded using P 
systems. Finally, some conclusions are drawn in Section 6. 



2 Multiset Theory, Constraints, and Language 

2.1 The Theory Bag 

The first-order language Lsag we consider here is based on a signature E = 
{E Bag, n Bag) and & denumerable set V of logical variables. pBag is a set of 
constant and function symbols containing the constant symbol nil, the binary 
multiset constructor {[ • | • ]}, plus possibly other (free) constant and function sym- 
bols. IlBag = “€”} is the set of predicate symbols. 

The distinguishing properties of multisets can be precisely stated by the first- 
order axiomatic theory Bag, defined parametrically in [9]. Bag is a hybrid theory. 
the objects it deals with are built out of interpreted as well as uninterpreted 
symbols. In particular, multisets may contain uninterpreted Herbrand terms 
as well as other multisets. A multiset can be built starting from any ground 
uninterpreted Herbrand term — called the kernel of the data structure — and then 
adding to this term the other elements that compose the multiset. We refer to 
this kind of data structures as colored hybrid multisets. 

We use simple syntactic conventions and notations for terms denoting mul- 
tisets. In particular, the multiset {[ si | {[ S 2 | • • • {[ Sn 1 1 ]}•••} ]} will be denoted 
by {[ Si, . . . , s„ 1 1 ]} or simply by {[ Si, . . . , s„ } when t is nil (i.e., the empty mul- 
tiset). We will use capital letters for variables, small letters for constant and 
function symbols. 

Example 1 (CBag terms). 

1. {[a, 6, a]} (i.e., {[ a | O I {[ ® I } ]}) is a term in CBag which, intuitively, 
denotes the multiset containing two occurrences of the element a and one 
occurrence of the element b; 

2. {[ a I A } is a term in CBag which, intuitively, denotes any multiset containing 
at least one occurrence of the element a (in other words, it denotes a partially 
specified multiset); 

3- {[{[Q]}){[^]}]}isa>C Bag term which, intuitively, denotes a multiset containing 
two nested singleton multisets. 

4. {[a, 6|c]}, where a,b,c are terms built from constant symbols is a multiset 
containing (one occurrence of) a and b and colored by the kernel c. We also 
say that ker({a, & | c]}) = c. 
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f,g £ F Bag , f ^ g 



m,n > 0 



Fig. 1. The theory Bag 



The axiomatic theory Bag is shown in Fig. 1. For a detailed discussion, see [9]. 

We focus here only on the meaning of axiom (if™). The behavior of the mul- 
tiset constructor symbol {[ • | • ]} is regulated by the following equational axiom 

(F;™) 'ixyz\x,y\z} = \y,x\z} 



which, intuitively, states that the order of elements in a multiset is immate- 
rial {permutativity property). This axiom forces syntactically different terms to 
possibly represent the same multiset. However, axiom (if™) lacks in a criterion 
for establishing disequalities between multisets, even if it is considered together 
with KWF{F2F3. Therefore, axiom (if™) is introduced in [9] to model (if™) and 
characterizing the multiset extensionality property: Two (hybrid) multisets are 
equal if and only if they have the same number of occurrences of each element, 
regardless of their order. 

2.2 The Constraints 

A multiset constraint is a conjunction of atomic formulae or negation of atomic 
formulae in Csag, that is, literals of the form t\ = t2,t\ yf t2,ti G t2,ti ^ t2, 
where ti is a first-order term in Csag- 

Example 2 (Lsag constraints). 

1. X G \a,h\Y} AX AX ^ a-, 

2. {[ {[h, F, o]}, {[o, 0} ]} = {[ VF, O]} A Fi G W A Y2 G VF A Yi yf Y2 (remember 
that capital letters denote variables, while small letters represent constant 
symbols); 

3. {[e,e,e,e,e]} = {[X|ii]} A X ^ {els'}. 

Let if Bog be the equational theory consisting of the equational axiom (if™). 
The standard model BAG for the theory Bag is defined in the following way: 
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— Let T(J^Bag) be the set of first-order ground terms built from symbols in 
^Bag (be., the ordinary Herbrand Universe). 

— The domain of the model is the quotient T{TBag)/ =EBag of T{TBag) over 
the smallest congruence relation =EBag induced by the equational theory 
EBag on T{TBag)- 

— The interpretation of a term t is its equivalence class [ t ] . 

~ = is interpreted as the identity on the domain T(TBag)l =EBag- 

— The interpretation of membership is the following: [ t ] G [ s ] is true if and 
only if there is a term in [s] of the form {[t | r]} for some term r. 

This model is a very important one. As a matter of fact in [10] we prove 
that if C is a constraint, its satisfiability (unsatisfiability) in BAG implies that it 
will be satisfiable (resp., unsatisfiable) in all the models of Bag {correspondence 
property). 

The notion of satisfiability of a formula in a model A is based on the notion 
of valuation. A valuation ct is a function from a subset of V to the domain A of 
A, extended to terms and formulas as usual, a is said to be a successful valuation 
of a formula (p if cr{ip) = true. Thanks to the correspondence property here we 
can safely restrict ourselves to consider the satisfiability in the model BAG- 

Example 3 (Valuations). 

1. Consider the constraints of Example 2. The first two are satisfiable. In partic- 
ular, any successful valuation for constraint 1 needs to map A to 6 (actually 
to the equivalence class [ b ] containing b. We admit this slight abuse of no- 
tation), while any successful valuation for constraint 2 needs to map W to 
{[/i, /i,o} (water) and O to {[o, o} (oxygen). The third constraint, instead, 
is unsatisfiable. 

2. Consider the constraint X = -^Y, Z} Aa ^ Y A nil ^ Y Ab ^ Z. A successful 
valuation for it is A i-^ {[ a, 6|}-, F ^ b, Z ^ a. 

The technique used in [10] to check satisfiability of multiset constraints is 
based on a constraint solving procedure which is able to reduce non-determ- 
inistically any given constraint C to a simplified special form — called the solved 
form — if and only if C is satisfiable (otherwise, C is reduced to false). 

A constraint C is in solved form if all its literals are in one of the following 
forms: 

— X = t and A does not occur neither in t nor elsewhere in C 

— A yf t and A does not occur in t 

— t ^ X and A does not occur in t. 

For instance, the constraint of Example 3.2 is in solved form. If C is a CBag 
constraint in solved form, then C is satisfiable in BAG. A solved form constraint 
therefore can be seen as a finite representation of the possible infinite set of its 
solutions. From a solved form constraint C it is possible to assign terms to the 
variables occurring in it in order to obtain a solution. Moreover, all the solutions 
of C can be obtained in this way. 

Solved form constraints can be obtained via suitable rewriting procedures. 
In Section 5 we will show how to implement some of them into P systems. 
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Example 4 (3-SAT). Consider the instance of 3-SAT: 

{Xi V V X3) A {^Xi V V X3) A {Xi V Xa V ^^3) 

It can be translated into the following Lsag constraint: 

{[{[Xi,ri},{[Xa,y2]},{[^3,i^3]}, 

{Xi, Fa, X3 ]}, {[ Fi, ^2, ^3 }, {^1, ^2, F3 } ]} 

= {{[0,1]}, {[0,1]}, {0,1}, {l|i?i},{l|i?2},{l|i?3}} 

where the variables Yi take the place of ^Xi. Each successful valuation for this 
constraint is also a solution for the given 3-SAT problem and, vice versa, from 
each solution for the SAT problem it is immediate to assign values to the variables 
Yi and Ri in order to obtain a successful valuation for the constraint. One such 
solution is, for instance: 

Xi = 1 A Xa = 1 A X3 = 1 

(and Yi = 0,ra = 0,^3 = 0,i?i = {0,1}, i?a = {0,0}, i?3 = {0, 1}). 

2.3 The Language CLP{BAQ) 

The availability of a constraint solver which is able to prove satisfiability of 
multiset constraints in a selected structure makes it immediate to define a CLP 
language dealing with multiset constraints of the kind we have considered so 
far. This language is obtained as an instance of the general CLP scheme [16,17] 
by instantiating it over the language, the theory, and the structure of multisets 
presented in the previous sections. 

The resulting language — called CLP(BAG) — can be viewed as a natural ex- 
tension of traditional logic programming languages, e.g., Prolog. The syntactic 
form of a CLP(BAG) program, as well as its operational, algebraic, and logical 
semantics are those of the general CLP scheme [16,17] suitably instantiated on 
the specific constraint domain of multisets. This is very similar to what is de- 
scribed in greater detail in [12] for the case of sets. A working prototype of the 
CLP language which embeds both sets and multisets is available at [23] . 

A CLP(BAG) program P is a finite set of rules of the form 

A : — c, B. 

where A is an atomic formula (the head of the rule) , c is a T sag-constraint and 
i? is a (possibly empty) conjunction of atomic formulae. A goal is a rule with 
empty head. As usual, we assume that all the variables in a clause are universally 
quantified in front of the clause itself. 

Example 5 ( CLP(BAG) program). The following is a CLP(BAG) program that 
defines a predicate once(X,M) which is true if X occurs exactly once in the 
multiset M: 

once(X, {X]R}) : - 
X ^ R. 

A sample successful goal for this program is : — once(b, {a, b, a}). 
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Note that clause selection in a CLP(BAG) program is performed using mul- 
tiset unification, that is two multisets are compared disregarding the order 
of their elements. Thus, for instance, unifying {[X|R]} and {[a, b, a} will non- 
deterministically bind X to all possible elements of the second multiset and R 
to the corresponding multisets composed of the remaining elements (that is, 
X=aAR=:{[b,a]}, X = bAR = {[a, a}, X==aAR={[a,b]}). 

CLP{BAQ) defines a simple, yet very expressive, framework that allows us 
to program using multisets in a very flexible way. 

3 Defining P Systems in Csag 

A P system is a computational model based on transitions governed by multiset 
rewriting rules. P systems have been defined to mimic the evolution of biolog- 
ical systems (see, e.g., [21]). Their definition formalizes the intuitive idea that 
biological reactions can be seen as computations. In this section we show how 
P systems can be naturally defined using the first-order language Csag and the 
theory Bag. 



3.1 Membrane Structures 

The language MS oi membrane structures is defined in [21] to be the set of 
strings over the alphabet {“[”, “]”} generated by the infinite set of rules: 

— []eMS 

— for n > 1, if si, . . . , s„ G MS, then [si, . . . , s„] G MS. 

A membrane is characterized by a pair of open and closed parentheses [, ]. The 
degree of a membrane structure is the number of membranes in it. An equivalence 
relation ~ is defined to govern membrane equivalence and an ad-hoc notion of 
depth is given. The outer membrane is called the skin. 

Moving to Csag, this is equivalent to say that MS is the set T{X) of first- 
order ground terms over the signature E = {nil, {]• | •]]■}. Such set is generated 
by the two rules: 

— nil G MS 

— if s, t G MS, then {[s 1 1 } G MS. 

Membranes are therefore identified with multisets. ~ is exactly the equivalence 
relation =EBag induced on T{E) by the (A™) permutativity axiom (c.f. Sec- 
tion 2). 

To compute the degree of a term s it is sufficient to count the number of 
occurrences of the constant symbol nil in s. This assertion is justified by the 
following observations: 

— if nil occurs as element of a multiset (i.e., as first argument of a term of 
the form {[ - | - ]}), such as in {] si, . . . , s„, nil | r]} then it must be counted as 
a pair of open and immediately closed brackets. 
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— Otherwise, if nil occurs as the tail of a multiset (i.e., as the second argument 
of a term of the form {[ _ | _]}), such as in {[ si, . . . , s„ | nil ]} then it exactly 
identifies one non-empty multiset. We recall that the multiset {[ si, . . . , ]} 

is represented by the term {[si|{[s 2 | • • • {[s„ | nil]} •••]}}. 

The notion of depth is the well-established notion of rank of a (multi)set: 

, , . _ J 0 if s = nil 

ran (sj } 1 -|- max{ronfc(t) : t € s} otherwise 

A compact way to state that a multiset {[ Si, . . . , ]} is included in a multiset 

{[ti, . . . , } is that of imposing the constraint (i? is a logic variable): 

{[tl,...,t„]} = {[si,..., Sm I .R ]} • 



3.2 Super-cells 

A super-cell is a membrane structure s in which each membrane (i.e., each 
multiset) can contain elements coming from a (possibly denumerable) universe 
U of other objects (i.e., non-membrane objects) [21]. Each membrane is uniquely 
identified by a natural number (its label). 

An active super-cell is a super-cell which may contain more than one occur- 
rence of membranes with the same label. Active super-cells are introduced in 
[22] to model the notion of membrane division. 

To naturally represent labeling of membranes in Csag we will adopt the 
notion of kernel.^ We fix the signature S to be defined as 

r = {{].|.}}uNniiU[/ 

where Nnii = {nil = nili, nil 2 , nils, . . .} is the denumerable set of constant 
symbols (to be used as multiset kernels), and [/ is a set of constant symbols (the 
objects) disjoint from Nnii. 

An active super-cell is a ground A'-term such that for all subterms of the 
form {] Si , . . . , Sm I fc ]}) where m > 0 and k is not of the form {] _ | _ }, it holds 
that k G Nnii (in other words, symbols from U cannot be used as kernels). A 
super-cell is an active super-cell such that each nil^ occurs at most once. 

Figure 2 shows a super-cell of degree 5 and the corresponding representation 
as an Csag term. 

As for membrane structures, equivalence between super-cells is multiset equiv- 
alence The notion of degree, instead, must be refined: the degree of an 

active super-cell s is the number of occurrences of constant symbols of Nnii in it. 
Hereafter, we usually refer to super-cells where there is exactly one occurrence 
of each constant nili, . . . ,nilrfggree(s)- 

^ As explained in Section 2, if there is at least one constant symbol c € E, then it 
is possible to write a term t of the form {[si, . . . , s„ j c]|. t identifies a multiset with 
the n elements si, . . . , s„ (not necessarily all distinct) and colored (labeled) by the 
constant c. We also say that c is the kernel of t, briefly ker(t) = c. 
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{[ a, 

b, 

{[a,a, fe, {[a,a,nil5 |nil4]} |nil2]}, 

{[ 6 , 6 , & I nils ]} |nili]} 



Fig. 2. A super-cell of degree 5 and the corresponding term 



3.3 Transition P Systems 

We give here the notion of P system using the language Csag- A transition P 
system of degree n can be defined as a tuple 

Tn — {U •, /t, (Ai, Pi), ■ • ■ , {Pm Pn)j ^o) 

where C/ is a universe of objects, p is a super-cell of degree n, io is the output 
membrane, and, for all t = 1, . . . , n, is a finite set of evolution rules (to be 
described below) and pi is a partial order relation over Ri. 

A computation of T„ is a sequence of super-cells p = po,/ti, • ■ • , Pm where 
Pj+i is obtained from pj by applying one or more evolution rules. 

Observe that the n membranes occurring in p are uniquely identified by 
their integer label i. Referring to the multiset representation, this means that 
the multisets in p are distinguished each other by their kernels, nili, . . . ,nil„. 
Hereafter, we will denote these multisets by mi, ... , m„. 

An evolution rule is a pair of the form u v where u is a string of elements 
of U and v is either of the form v' or v'S where v' is a string over 

{V X {here, out}) U {V x {irii, . . . , in„|). 

and V is an alphabet. A rule in Ri applies to the multiset m^. Applying a rule 
causes some effects to occur either on m^ or on the multiset possibly containing 
mi according to the form of the right-hand side of the rule. More precisely, the 
semantics of evolution rules, expressed in terms of the multiset representation 
introduced so far, can be described as follows. Let us consider first rules of the 
form u -I- v', that is without S. Consider a rule in Ri with the following general 
form: 

ui...Uhi-^ {vi, here) • • • , here) (wi, out) • • • out)(zi, in,n) • • • (^fcad^x^g) 

The rule can be applied to the multiset mi if mi contains (in any order) all 
the objects Ui,. . . ,Uh- In the language of multiset constraints, this fact can be 
expressed by requiring that the constraint 

mi = {ui, . . . ,Uh \ M} A ker(M) = nil* 

holds, for some multiset M (assume ker is the function returning the kernel of a 
ground multiset). Applying this rule yields the following effects: 
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a. local effect: 

replace in /r by {[ui, . . . , \M\ (if fci = 0 replace rrii by M). 

b. exit effect: 

if rrii G rrij then replace nij in /r by {[ wi, . . . , Wk-^ \ iTij ]}; if rrii is the skin 
then simply remove the elements Wi, , Wk 2 from it. 

c. adjacency effect: 

for j = 1, . . . , fca, if rrii G rrixj or rrixj G rrii (i-e. the two membranes are adja- 
cent), then add Zj to rrix^ ■ If for some j = 1, ... ,ks the adjacency condition 
is not fulfilled, the application of the rule is not allowed. 

Consider now the scheme of a <5 rule {dissolving rule): 

■ui . . .M/i 1 -^ (ui, here) • • • , here) (ici, out) • • • (wfc^, out)(zi, in^^J • • • {zk^j'^^x^JS 

After having obtained the local, exit, and adjacency effects as in the rule of 
the first kind, the S causes the further effect of destroying mi and carrying out 
its elements (unless is the skin that cannot be destroyed). Thus, if mj = 
} (i.e. G mj) and mi = {[ ci, . . . , | nil^ } then replace mj by 

Id,..., ce\mb}. 

3.4 P Systems with Active Membranes 

In [22] new rules are allowed to occur in P systems. In particular we are interested 
here in the possibility of creating a copy of a membrane. Thus we consider the 
following simplified version of the division rule [22] 

Ui...Uk^ ((Vi . ..Vh)(wi . ..Wj)) 

The semantics of this rule, expressed in terms of the multiset representation 
of super-cells introduced in this section, can be described as follows. The rule 
can be applied — like other rules — to the multiset if contains u\, . . . ,Uk, 
that is if the constraint 



mi = \ui, . . . ,Uh\M\ ^ ker(M) = nil* 
holds, for some multiset M . Applying this rule yields the following effects: 

— remove elements u\, . . . ,Uk from mp, since mi = {[ui, . . . , rtfc | M}, M rep- 
resents what remains of the multiset mp, 

— make two copies of M (both having the same kernel nil^) and add all the 
elements v\,. . . ,Vh to the first copy and w\, . . . ,Wj to the second copy; that 
is, in terms of our multiset representation, replace mi in /i with the two new 
multisets 

lvi,...,Vh\M} lwi,...,Wj\M}. 

Fig. 3 shows an example of the effect of the application of the membrane 
division rule. Observe that this rule can be applied to more than one membrane, 
possibly simultaneously. 
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aa —> (b) (be) 

Fig. 3. Membrane division 



4 Implementing P Systems in CLP{3AQ) 

We show that a transition P system can be written in an easy way as a program 
in a CLP language which supports a decision procedure for multiset constraints. 
In particular, we show how to translate a P system into a CLP(BAG) program 
(see Section 2.3). 

The main predicate governing the application of the evolution rules is: 

p_interpreter(Membrane, Membrcuie) : — 
not rule(MembrcUie, _). 

p_interpreter(MembraneIn, MembraneOut) : — 
rule(MembraneIn, MembrcUielnt), 
p_interpreter (MembrcUieInt, MembraneOut) . 

Note that the multiset transformation process terminates as soon as no rule 
can be applied successfully to the input multiset (first argument of p_interpreter) . 
Each evolution rule of each set Ri of the given P system can be (automatically) 
translated into a single CLP{BAQ) clause, according to the multiset-based rep- 
resentation of membranes and of computations described in Section 3.3. As an 
example, consider the rule: 

Ml . . . 1 -^ (ui , here) • • • (wfcj , here) (wi , out) • • • (wfc^ , out) . 

Assume that mi is not the outer membrane (in that case the clause must be 
slightly changed): 

rule(MembrIn, MembrOut) : — 

choose(MembrIn, nili. Mi, Mj), 

Ml = {ui, . . . ,Uh |M(]}, 

replace(MembrIn,Mi, {[ vi, . . . , | M( ]}, Ml), 

replace(MembrIn, Mj, {[ Wi, . . . , | Ml ]}, MembrOut). 

Mj is the membrane such that Mi € Mj . Multiset unification in the second sub-goal 
is used to test applicability of this rule to Mi (find whether ui, . . . ,Uh belong to 
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Mi), vi, . . . , Vjjj are the here elements. The last sub-goal updates the membrane 
containing Mi using the out elements Wi , . . . , . 

The auxiliary predicates choose and replace can be simply programmed in 
CLP(BAG) as follows. 

choose(Membrane, Kernel, Mi, Mj) chooses a multiset Mi in Membrane with ker- 
nel Kernel and the multiset Mj that contains it. 

choose({[Mi I R]}, Kernel, Mi, {[Mi | R}) : — 
kernel(Mi, Kernel). 
choose({[M I R}, Kernel, Mi, Mj) : — 
choose(M, Kernel, Mi, Mj). 

The predicate rep\ace{A, B , C, D) finds one occurrence of i? in A and replaces 
it with C obtaining D. 

replace(A, A, C, C). 
replace({[B|R]},B,C,{[C|R]}). 
replace({[M|R]},B,C,{[M'|R}) : - 
M/B, 

replace(M, B, C, M'). 

The predicate kernel(M, K) stating that the kernel of the multiset M is K can be 
programmed by: 

kernel(nili, nili). 



kernel(niln, niln) . 
kernel({[ A I B]}-, K) : — 
kernel(B, K). 

To implement the division rule it is convenient to introduce also a predicate 
rep\ace2{A, B,C1,C2, D) whose structure is the same as that of replace but it 
replaces B in A with both Cl and C2. 

We give here an example running on the interpreter of CLP{BAQ) [23] of a 
P system computing the function lA. 

Example 6. Consider the P system such that U = {a, b}, 

^l = {{a, I nil2 [} I nili } 

n 

The set of rules Ri is empty, while the rules in i ?2 are: 

(1) abb I— > {{bb){ab)) 

(2) ab^ b 

(3) 66 1 -^ 6(6, out) (6, out) 

(4) 6i-^(6, out)6 

ordered as (1) > (2) > (3) > (4). (As common in P systems, we assume that 
u 1 -^- (ui, here) can be abbreviated as u i-^- ui ) . At the end of each computation. 
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occurrences of the object b are present in the outer membrane. All other 
membranes in it have disappeared. Basically, we have used the well-known fact 
that ^)- produce membranes containing 6", . . . ,b^. 

Then we move out the tokens, in accordance with the mathematical law. The 
following CLP(BAG) clauses can be used to simulate these rules: 

%%%{!) 

rule(MembrIn, MembrOut) : — 

choose(MembrIn, nil 2 , M^, Mj), 

Mi = {a,b,b|Mi}, 

replace2(MembrIn, Mi, {[b, b | Mi ]}, {[ a, b | Mi }, MembrOut). 

%%%(2) 

rule(MembrIn, MembrOut) : — 

choose(MembrIn, nil 2 . Mi, Mj), 

Mi = {a,b|M']}, 

replace(MembrIn, Mi, {[b | Mi ]}, MembrOut). 

%%%(3) 

rule(MembrIn, MembrOut) : — 

choose(MembrIn, nil 2 . Mi, Mj), 

Mi = {b,b|M']}, 

replace(MembrIn, Mi, {[b | Mi ]}, Membrint), 
replace(MembrIn, Mj, {[b, b | Membrint ]}, MembrOut). 

%%%(4) 

rule(MembrIn, MembrOut) : — 

choose(MembrIn, nil 2 . Mi, Mj), 

Mi = {b|Mi]}, 

Mj = {[Mi I Membrint}, S removes Mi from Mj 
repIace(MembrIn, Mj, {[b | Membrint }, MembrOut). 

The CLP{BAG) program that simulates the given P system, therefore, is 
composed by the clauses for predicates p_interpreter and rule along with 
those for the auxiliary predicates choose, replace, and replace2. A goal for 
this program can be: 

: — p_interpreter({[ {[ a, b, b, b | nil 2 } | nili }, MembrOut), 
from which we get the computed answer 

I b, b, b, b, b, b, b, b, b I nili } 

Note that rule ordering is obtained by exploiting the corresponding CLP {BAG ) 
clause ordering. This solution works correctly for the first successful computa- 
tion; however, if other alternative computations are attempted through back- 
tracking, clause ordering is no longer sufficient to guarantee rule ordering. To 
force the desired ordering is always preserved one could add a unique identifier 
to each rule and slightly modify the definition of predicate p_interpreter so as 
to apply rules according to a list of rule identifiers which explicitly states the 
rule ordering for each membrane. 
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5 Implementing Constraint Solvers by P Systems 

Given a first-order language C, an interpretation structure A, and the corre- 
sponding theory, we show how to implement the algorithm for deciding satisfi- 
ability in ^ of a /^-constraint C using a transition P system derived from C. 
Basically, the condition for this translation to be fully automatized is that the 
constraint solving algorithm is given via rewriting rules (a condition often ful- 
filled in constraint decision procedures) . In this section we focus our attention on 
the solution of equality constraints, that is on the unification problem. We show 
how to implement both standard and extended unification (namely, multiset 
unification) algorithms using P systems. 



5.1 Standard Unification 

We start from the classical first-order unification problem — c.f., e.g., [15,20,1]. 
Let 27 be a first-order signature and C, si = A • • • A s„ = be a constraint 
composed by conjunctions of equations (i.e., an equation system). The unification 
problem, in its decision version, consists in checking if there is a substitution 6 
(a function from the variables occurring in C to the set of first-order terms) such 
that for alH = 1, . . . , n, 9{si) and 9{ti) are syntactically equal. A survey of the 
problem with the main results can be found in [1] . 

Let T{C) be the transition P system 

(C/,/x, (i?i,pi),(i?2,P2),*o) 



where: 

— the universe U is the set of all ground equations, {s = t : s,t G T{E)} 

— = {[m= jnili]} 

~ m= = \ si=ti,...,Sn = t„|nil 2 } 

— to = 1 (i.e. the output membrane is the skin) 

— Ri and i ?2 are the sets of evolution rules for multisets with kernels nili and 

nil 2 , respectively, that implement the unification test. 

Evolution rules for R 2 are obtained from the non-deterministic unification 
algorithm Unify [15,20] and are shown in Figure 4. Actually, each rule is a meta- 
rule consisting of all its infinite possible instances. However, only a finite number 
of instances depending on C is needed, as explained below. 

Ri, instead, is assumed to be empty in this case. 

Classical results on Herbrand’s non-deterministic algorithm ensure termina- 
tion of the computation of the transition P system independently of the order- 
ing chosen when rules are applied. Moreover, each non-deterministic execution 
is equivalent (equivalence of m.g.u.’s modulo variants) [1]. Thus, no ordering 
is needed between rewriting rules to ensure termination, and all the possible 
non-deterministic computations of the P system can be considered equivalent. 

By analyzing the computations of the P system it is easy to prove that 
the dimension of the terms involved can be bounded by the size of the initial 
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(ri) 
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II 

><; 


j- e (remove tautologies) 


(t'2) 


t^x] 
t is not a variable 1 


I X = t (i.e., (X = t, here ) ) 


(^■ 3 ) 


X = t A = s 1 
X does not occur in t I 


|-i-^X = t t = s 


(r4) 


x = t y = X 1 

X does not occur in t I 


|. X = t Y = t 


(rs) 


X = t\ 

X and X occurs in t I 


(false, out) 


(ra) 


f{si,...,Sm) = g{ti,...,tn) ] 

f^g\ 


I (false, out) 


(rr) 


/(Sl, . . . , Sm) — /(il, • • • , tm) 


Si = • • • Sm = tm 



Fig. 4. Unification as evolution rules (-R 2 ) 



constraint C used to define T{C). More precisely, let size and height be defined 
as follows: 

{ 0 if t is a variable 

1 + Ya=i size{U) if t = f{ti, 

Yh=i size{si) + size{ti) ii t = si = ti ^ ^ Sm = tm 

h ■ ® if t is a variable 

11 + max"^^ height{ti) if t = f(ti, . . . ,tn) 

Proposition 1. Let s = t be an equation occurring at some point during the 
computation o/Unify(C). Then height(s) < size{C) and height{t) < size{C). 

Proof. Immediate, by classical results. □ 

Thus, the number of rules in i ?2 can be chosen to be finite. Let S be the 
set of terms built using the function, constant, and variable symbols occurring 
in C of height bounded by size{C). R 2 is chosen as the set of instances of the 
meta-rules of Fig. 4 over the set S. 

Proposition 2. Let p, = piQ,...,pLm he a terminating computation of the P 
system T{C). C does not admit solutions if and only z/ false G m\ G ptm- 

Proof. The evolution of m 2 mimics the execution of standard unification algo- 
rithm Unify (that is terminating and correct). A successful termination does not 
produce false. Otherwise, false is stored in the membrane m± and never re- 
moved. □ 
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Observe that the P system obtained in this way, although generated by an 
instance C, is able to deal with a wide family of equation systems as inputs. 
Precisely, all input systems with variables, function and constant symbols from 
those of C, and with initial size bounded by that of C. 

5.2 Multiset Unification 

When one or more function symbols from S have some semantic properties, the 
unification problem becomes more complex. For instance, in the case of multi- 
set unification (as it happens in Bag) we use a binary function symbol whose 
semantics is governed by axiom (E™); in this case the unification problem turns 
out to be NP-complete [9]. We show here how to map the multiset unification 
algorithm of [9] to a P system with active membranes (using membrane divi- 
sion). This allows one, for instance, to obtain an alternative (polynomial-time) 
implementation of SAT using P systems, since SAT can be reduced to multiset 
unification (cf. Example 4). 

Let E = {[nil, {[• | •]},... } be the signature of Lsag- The transition system 
T{C) is the same as in Section 5.1, save for the rewriting rules Ri and i? 2 - They 
are chosen in order to simulate the multiset unification algorithm of [9] and are 
reported in Fig. 5 and Fig. 6. Moreover, some ordering on the rules of R 2 should 
be imposed to ensure termination in all possible cases as described in [9] . 



in) 


'X 


• £ (remove tautologies) 


(r2) 


t^x' 

t is not a variable 


i A = t (i.e., (A = t, here) ) 


(ra) 


A = t A = s^ 
X does not occur in t 


ii-^X = tt — s 


(ri) 


x = t y = A ^ 

X does not occur in t 


> X ^t Y = t 


(rs) 


X = 

X ^ t and X occurs in t 


5 


(re) 




5 


(r?) 


/(Sl, . . . , Sm) = • • • 1 tm) 


r" 1 — > Si = tl • • • Sm = tm 


(rs) 


{[ti,...,t^| A} = {[si,...,s„|iV]} 


’■ 1 {[ , . . . , tm ]} = {[ Si , . . . , Sri ]} 


(r9) 




1 i{t = t' s = s') 

( ^ (s = {[t'|A^Hi|^]} = s')) 



Fig. 5. Multiset Unification as evolution rules {R 2 ) 
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(ri) 



s = t I 1 -^ £ (remove equations) 



Fig. 6. Evolution rules for the skin {Ri) 



Consider the rules for membranes with label 2 (i.e., multisets with kernel 
nil 2 ). Rules ri-r 4 are the same as for the standard unification case. Rule ry is 
now restricted to non-multiset terms. Rules rg and rg treat the case of equalities 
between two multisets. These two rules introduce a slight notational ambiguity. 
As a matter of fact, the multisets occurring in those rules are not membranes 
but multiset terms in the first-order language in which the equations are written. 
This kind of confusion should be avoided by using different function symbols for 
building multiset terms in equations (e.g., [[ ] and | - | •]). Rule rg deals with 
the special case of two non-ground multiset terms with the same tail variable. 
Rule rg, instead, deals with the general case. It requires the ability to express 
a (don’t know) non-deterministic choice between two possible computations, 
corresponding to the two alternatives in the right-hand part of axiom It 

is implemented by membrane division. 

Rules rg and rg dealing with failures cause now the deletion of the mem- 
brane with label 2 in which the condition is fulfilled, putting all its remaining 
elements (if any) in the outer membrane. Therefore, equalities can be present 
in membrane 1 as effect of deletion of membranes with label 2. Rule ri in Ri 
removes all of them. At the end of the execution, membrane 1 is either empty or 
it contains some membranes with label 2, each one corresponding to a successful 
non-deterministic branch of the computation. 

Proposition 3. Let {j, = /io,...,/im be o, terminating computation of the P 
system T{C). C is unsatisfiable if and only if mi is empty. 

Proof. Immediate from the fact that we mimic the algorithm in [9]. □ 

As an example consider the equality constraint C that models the instance of 
3-SAT of Example 4. The P system T(C) which tests the satisfiability of this con- 
straint can be seen as (yet another) implementation of an algorithm to solve the 
3-SAT problem using P systems. Computations of T(C) run in non-deterministic 
polynomial time. If membrane division is implemented in an effective way, this 
yields a polynomial time method for solving 3-SAT (see also [22]). 

The constraint solver of CLP(BAG) (see [10]) deals not only with equality 
constraints, but also with disequality, membership, and not-membership con- 
straints. The same technique used for implementing unification as evolution rules 
can be applied almost unaltered to implement also the rewriting procedures for 
the other kinds of constraints. Thus the whole constraint solver of CLP{BAQ) 
can be implemented as & P system. 

Therefore, any problem which can be easily expressed as a multiset constraint 
(containing equalities, disequalities, memberships, and negation of memberships) 
can be automatically implemented using a P system. 
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6 Conclusions 

We have described how to use the language L Bag to define P systems in a fairly 
natural way. This gives a precise formulation of P systems inside a first-order 
theory. Moreover, we have shown that the CLP language CLP{BAQ) is particu- 
larly well-suited to simulate P systems, thanks to its capabilities of manipulating 
multisets in a very general and flexible way. Then, showing that the constraint 
solvers of CLP{BAQ) can be implemented by P systems (endowed with the 
membrane division rule), we have suggested an alternative way to encode, in a 
rather natural way, NP problems which are easily described using Csag con- 
straints into P systems. 

As a future work it could be interesting to analyze if using a Prolog system 
that supports some form of parallelism (e.g., [13]) as the execution environment 
where to embed our CLP language would provide a way to enhance also our 
simulation of P systems in the direction of a real parallel execution. 
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Abstract. A macroset is a (finite or infinite) set of multisets over a hnite 
alphabet. We introduce a Chomsky-like hierarchy of multiset rewriting 
devices which, therefore, generate macrosets. Some results are proved 
about the power of these devices and some open problems are formulated. 
We also present an algebraic characterization of some of the macroset 
families as least fixed point solutions of algebraic systems of equations. 



1 Introduction 

In the last years, the idea of multiset processing has appeared more and more 
frequently in various domains: nondeterministic programming [1], the chemical 
abstract machine [6], DNA computing [28], membrane computing (P systems) 
[25], [26] (see a current bibliography of the area in [27]). Multisets also appear 
in logic [4], linguistics [7], [14], artificial life [30], etc. The thesis [32] is entirely 
devoted to mathematically formalizing multisets. Several papers have considered 
fuzzy variants of multisets ([24], [21], [35]), while [12] and [5] deal with pomsets 
(partially ordered multisets). 

A multiset over a given set of objects corresponds to a string over the alphabet 
naming those objects. To a language it corresponds a sets of multisets, called 
here a macroset. 

Observing that P system theory is devoted to a generative approach to mul- 
tiset processing in a distributed way, in [27] one formulates the problem of pro- 
ducing a non-distributed theory of multiset generating, like Chomsky grammar 
theory. We systematically address here this question, by considering a natural 
translation of notions related to sequential rewriting of strings to rewriting of 
multisets. A Chomsky-like hierarchy of multiset grammars is obtained, but with 
some relationships between classes of macrosets which are different from those 
among the corresponding families of languages. 

* Research supported by the Direccio General de Recerca, Generalitat de Gatalunya 
(PIV). 
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In this framework, an important question could be solved: is erasing impor- 
tant in the case of non-context-free macroset grammars? This corresponds to the 
problem whether or not the Parikh sets of string context-free matrix languages 
generated with and without using erasing rules are the same, which, in turn, is 
related to he problem still open in the regulated rewriting area whether or not 
erasing rules increase the power of string matrix grammars (without appearance 
checking) . 

It should be noted that in [30], [31] one already considers an abstract rewrit- 
ing multiset system (ARMS, for short), which corresponds to our arbitrary 
multiset grammars, without introducing particular (Chomsky-like) variants; the 
ARMS are mainly used as a formal framework for simulating the development 
of populations in artificial life-like questions. Moreover, languages of multisets 
(or bags), also called commutative languages, and language families (trios, semi 
AFL’s, AFL’s) of commutative languages, have been considered in [8], [18], [19], 
[20], [16]. None of these papers proposes a Chomsky-like hierarchy of multiset 
processing grammars. 

2 Multiset Grammars 

All formal language theory prerequisites we use can be found in [29]. 

Let P be a finite alphabet; its elements are called both symbols and objects. 
The set of strings over V is denoted by V*; the empty string is denoted by A, the 
length of a: G V* is denoted by |x|, and the number of occurrences of a symbol 
a G P in a string a: G P* is denoted by |a;|a. 

A multiset over an alphabet P is a mapping fj, : V — > N. We denote by 

the set of all multisets over P. We can represent the multiset /i as a vector, 
= (m(oi)) ■ • • ) M(on))) for P = {ai,...,a„}. Note that the ordering of 
the objects in P is relevant. The multiset /i can also be represented by any 
permutation of the string Clearly, for each permutation 

w' of we have 'I/v{w') = t^{V), where Wy is the Parikh mapping associated 
with P. Sometimes we shall also write for a multiset fx. 

Conversely, with each string x G V* we can associate the multiset Hx ■ P — 
N defined by /Xa,(a) = |a;|o, for each a G P. 

For a multiset /i we denote by |/i| = weight or norm; 

for U CV, we also denote \fj,\u = 

A (finite or infinite) set of multisets over an alphabet P is called a macroset. 
Thus, V* is the universal macroset over P (it corresponds to the “universal 
language” V* over P). A macroset M C P#, for P = {ai,...,a„}, can be 
naturally represented by the set of vectors {{tx{V)) \ ^ G M}. 

For two multisets fXi,iX 2 over the same P we define the inclusion fXi C H 2 , 
the sum ni + 1 x 2 , the union /ii U /i 2 , the intersection yti n 1 x 2 , and, for the case 
Ml C M 2 only, the difference ixi — 1 x 2 in the following ways: 

Ml ^ M 2 iff Mi(o) ^ M 2 (a)) for all a G P; 

(Mi + M 2 )(a) = Mi(a) + M 2 (a)> for each a G P; 




Toward a Formal Macroset Theory 



125 



(/Xi U /i2)(a) = niax(/ii(a), /i2(a)), for each a G V; 

(/ii n /i2)(o) = min(/ii(a), /X2(a)), for each a G V; 

{fii — 1x2) (a) = 1x1 (a) — 1x2(0), for each a G V. 

A usual set C/ C y is a multiset /X[/ with fiu{a) = 1 iff o G C/. The empty 
multiset is denoted by 0 and it corresponds to the empty string A. 

The correspondence among strings and multisets (hence among languages 
and macrosets) suggests considering the following macrosets generating devices. 

A multiset grammar is a construct G = {N, T, A, P), where iV, T are disjoint 
alphabets, the nonterminal and the terminal one, respectively (we denote their 
union hy V), A G is a finite macroset over V (its elements are called axioms, 
and P is a finite set of multiset rewriting rules (in short, rules) of the form 
Ml ^ /X2) where /Xi;M2 are multisets over V and I/XiIat > 1 - (Taking advantage 
of the string representation of multisets, we will usually write strings instead of 
multisets in A and in rules.) 

For two multisets a\, 02 over V, we write a\ for some r : /xi ^ /X2 G 

P if /xi C a\ and «2 = (oi — /xi) + M2- If r is understood, then we write 
instead of =^^.. We denote by the refiexive and transitive closure of the 
relation =^. The macroset generated by G is defined by 

M{G) = {(3 G I a ( 3 , for some a G A}. 

A natural Chomsky-like classification of such grammars is the following one: 

1 . Grammars G as above are said to be arbitrary. 

2 . If |/ii| < |/i2| for all rules yL\ /X2 in P, then G is said to be monotone. 

3 . If |/ii| = I for all rules yti /X2 in P, then G is said to be context-free. 

4 . If 1 ^ 1 1 = I and |m2|at < 1 for all rules /xi — > /X2 in P, then G is said to be 
linear. 

5 . If 1 ^ 1 1 = 1 , 1^2! < 2 , and |M2|Af < 1 for all rules /xi ^ ^2 in P, then G is said 
to be regular. 

We also consider the following subclass of linear grammars, which corre- 
sponds to no Chomsky class: 

6 . If G is a linear multiset grammar such that for each rule /Xi ^ /X2 in P such 
that M2(oi) > 0,M2(oj) > 0 for some 1 < x < j < n, imply M2(ofc) > 0 for 
alH < fc < j, then G is said to be local. (We always increase the number of 
copies of objects which are adjacent in the ordering of V.) 

We denote by mARB, mMON, mCF, mLIN, mREG, mLOC the fami- 
lies of macrosets generated by arbitrary, monotone, context-free, linear, regular, 
and local multiset grammars, respectively. By FIN, REG, LIN, CF, CS, RE 
we denote the families of finite, regular, linear, context-free, context-sensitive, 
and recursively enumerable languages, respectively. For a family F of languages 
we denote by PsF the family of Parikh sets of vectors associated with languages 
in F. The family of all semilinear languages is denoted by SLin. 

We also consider here matrix grammars, with and without appearance check- 
ing, both for the string and the multiset case. The definition for multisets is a 
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direct extension of the definition for strings. Always we consider only matrix 
grammars with context-free rules. We denote by MAT the family of languages 
generated by matrix grammars without A rules and without appearance check- 
ing; if erasing rules are used, then the superscript A is added, if appearance 
checking features are used, then the subscript ac is added. For the multiset 
case we add the letter m in front of these notations, thus obtaining the families 
mMAT, mMAT^, mMATac, and mMAT^^, respectively. 

3 Generative Power 

We start by proving a normal form result which says that instead of a finite set 
of axioms, we may consider one axiom only, in the form of a single nonterminal 
symbol (like in Chomsky grammars). 

Lemma 1. For each multiset grammar G = (N,T, A, P), there is an equivalent 
multiset grammar G' = {N' ,T,A' ,P'), where A' contains only one multiset S 
over N' U T, such that [S'! = 1. Moreover, G' is of the same type as G. 

Proof. For arbitrary, monotone, and context-free grammars the assertion is ob- 
vious: take one more nonterminal symbol, S, add it to N and add all rules 
S —>■ w, for w € A, to P. We obtain an equivalent grammar, which is monotone 
(context-free) if G was monotone (context-free). 

For the linear case we proceed as follows. Assume that A = {wi, . . . , w„}, 
with Wi G (JV U T)^. Let N{wi),T{wi) be the multisets of nonterminals and of 
terminals, respectively, which appear in Wi,l < i < n, and let t = max{|A(r(;j)| | 
1 < i < n}. Then, consider 

N' = {[w] G A* I 1 < |w| < t} U {S}, 

A' = {^}, 

P' = {S ^ [A(wi)]T(wj) I 1 < i < n} 

U {[w] ^ [w']x I [ru], [ic'] G N' , w' = {w — X) + Y, for a rule 
X ^Yx G P,X,Y G N,x GT*} 

U {[X] ^ X \ tor X ^ X G P, X G N,x G T*}. 

The equality M{G) = M{G') is obvious and also obvious is the fact that G' is 
linear, regular, local if G is linear, regular, local, respectively. □ 

Because most of the relationships between the above mentioned families of 
macrosets are easy to be established, we directly give a synthesis result (also 
including a relation which will be proved later): 

Theorem 1. (Macroset Hierarchy Theorem) The relations in the diagram from 
Figure 1 hold; the arrows denote inclusions of the lower families into the upper 
families; all these inclusions are proper, with the exception of the arrow marked 
with a question mark, for which the properness is open. 

Proof. In view of Lemma 1, we can always assume that the set of axioms contains 
only one axiom, consisting of a single nonterminal. 
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The equalities mREG = PsREG, mLIN = PsLIN and mGF = PsGF are 
obvious; because PsREG = PsLIN = PsGF = SLin, we get the equality of all 
these families (see also Section 4). 

The equality PsREG = SLin follows from the definition of semilinear sets 
as finite unions of linear sets : M = lj”^i Mi with Mi = {ci + J2Y=i I G 

N} which can be generated by a regular multiset grammar S Si, Si Ai + Ci, 

Ai Ai + Xij, Ai Xij, and the fact that PsGF C SLin ([11], Theorem 5.2.1). 

It is obvious that mFIN = PsFIN, and that mLOG contains infinite ma- 
crosets. 

Consider the macroset M = {{n,l,n) | n > 1}, for V = {01,02,03}. It is 
easy to see that we cannot generate this macroset by a local multiset grammar 
(all vectors (n, 1, n), n > 1, belong to M, hence we have to increase the number 
of Oi and 03 occurrences in a synchronized way, but we cannot do this in a local 
way), but we have M G mLIN. This shows that mLOG C mLIN is a proper 
inclusion. 

Given a monotone multiset grammar G = (N,T, A, P), we can construct a 
matrix multiset grammar in the following way. For each X G N UT consider the 
new symbols X' , X" . For each rule X-^X-i . . . X}~ Y 1 Y 2 . . .Yj. in P, with k < r 
(we use the string representation of rules), consider the matrix 



(X{ - El 



1 5 • 






v" v' v'v' v' v" v' v" ^ v' \ 

— fc fc+1 • • • 1 1 ’ ' • • 5 1 ^k — l)' 



(The use of double primed symbols prevents the rewriting of an object introduced 
by a previous rule of the same matrix; this is allowed in matrix grammars but 
not in monotone grammars.) The matrix grammar with these matrices, with all 
primed symbols as nonterminals, and with the axioms obtained from the axioms 
of A by priming their objects generate the same macroset as G. The technical 
details are left to the reader. Thus, mMON C mMAT. 

Now, the equality mMAT = PsMAT is obvious. (A normal form as that 
in Lemma 1 we have started with also holds for matrix grammars. See also 
[8].) From a string matrix grammar G = (N, T, S, R) in the binary normal form 
we can immediately pass to a multiset matrix grammar G' (for (A —>■ Y, A —>■ 
x) G R consider the multiset rewriting rule XA Yx, etc) such that M{G') = 
'Pt{L{G)) . Consequently, mMAT C mMON, and therefore mMON=mMAT 
= PsMAT. 

The inclusion mCF C mMON is proper because MAT contains non- 
semilinear languages. The simplest example, from Petri net theory, is ac 
add, bd ^ be, a ^ b,b ^ ae, a ^ e with axiom ac, generating the non-semilinear 
multiset language L = {fj, G {c, e}^ | ^(c) < 

The proper inclusion mMON C mARB is a consequence from the facts 
mMON C PSPAGE, that the uniform word problem is EXPSPAGE-hard 
for mARB [15], and that PSPAGE C EXPSPAGE. 

The equality mMATac = PsMATac is obvious. PsMAT C PsMATac is a 
proper inclusion, because the one-letter languages in MAT^ are regular, while 
MATac contains the language |a^ I n > 1}. Therefore, both the inclusions 
mMON C mMATac and mMAT^ C mMAT^^, are proper. 
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The equalities mARB = mMAT'*' = PsMAT^ follow in the same way as 

mMON = mMAT = PsMAT. 

The equality mMAT^^. = PsMAT^^ is obvious, PsRE = PsMAT^^. fol- 
lows from RE = MAT^^,. Because there are one-letter languages in RE which 
are not in CS, the inclusion PsCS C mMAT^^, is proper. 

The inclusion PsMAT ac C PsCS follows from MATac C CS. 

mETOL = PsETOL is obvious since derivations are context-independent. 
The proper inclusion mCF C mETOL follows from the fact that the non- 
semilinear language L = {a^ | n > 1} is in mETOL. mETOL C mMATac is 
a consequence of ETOL C MATac- □ 

Observe the unexpected relations mMON = mMAT, mARB = mMAT'^, 
mMON C mMAT^ CmMATac, mARB C mMAT^^., mMON C PsCS, 
which correspond to relations from the language case which are different, un- 
known, or even opposite (this last case appears for mMON C mMATac versus 
MATac c CS). 

Quite interesting is also the fact that mARB C PsRE is a strict inclusion. 
This means that arbitrary multiset rewriting rules are not sufficient in order 
to get a characterization of the power of Turing machines as multiset process- 
ing devices. Regulated rewriting of multisets (following the model of regulated 
rewriting in formal language theory, [9]) is thus necessary and, at least for matrix 
grammars with appearance checking, we get a characterization of PsRE. 



mMAT(;e=PsRE= PsMAT);^ 



mARB=mMAT^=PsMAT^ PsMON=PsCS 

I t ? 






m M AT ac = P s M AT ac 



mMON=mMAT=mARBib=PsMAT mETOL=PsETOL 




mREG=mLIN=mCF=PsREG=PsLIN=PsCF=SLin 



mLOC 



mFIN=PsFIN 



Fig. 1. The hierarchy of macroset families 
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Note also that PsRE is of the power of Turing machines since any word u 
can be encoded as a multiset even as G N such that any L G RE can 

be represented by a multiset language /r(T) G PsRE. 

Many such characterizations can be found in the distributed framework of P 
systems; the previous observation that mARB C PsRE can also be seen as a 
further motivation of membrane computing. 

The corresponding hierarchy of families of word languages is shown in Fig- 
ure 2 [10] where also the family ETOL has been added. 



MAT^e=RE 




MAT^ MON=CS 





CF 



LIN 



REG 

Fig. 2. The hierarchy of word language families 



In the diagram in Figure 1 there appears a family which was not defined yet, 
mARB lb. This is the family of macrosets which can be generated by arbitrary 
multiset grammars with a linearly bounded workspace. 

For G = {N,T, A, P), let ct : a /3 be a derivation with a € A. The 
workspace of cr, denoted by IPS' (cr), is the maximal weight of multisets appearing 
in a. Then, for (d G we put 

WScild) = inf{lP5'(cr) | cr : a /3, a G A}. 

We say that G has a linearly bounded workspace if there is a constant k such 
that for each non-empty fd G M{G) we have WS{(d) < k\(d\. 

The family of all macrosets generated by grammars with a linearly bounded 
workspace is denoted by mARBib. 
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The following result corresponds to the workspace theorem in formal language 
theory; the proof for multisets is simpler than in the case of languages. 

Theorem 2. (Workspace Theorem) mMON = mARBib- 

Proof. We have only to prove the inclusion mARBib ^ mMON. 

Consider an arbitrary multiset grammar G = (N, T, A, P) and a constant 
k such that WSa{f3) < k\f3\ for all non-empty (3 G M{G). We construct the 
monotone multiset grammar G' = {N' ,T, S, P') with 

N' = {{u) \ uG{NUTU {&})#, |u| = k}U {S'}, 

where S and b are new symbols, and the following rules (we denote V = N U 
TU{b}): 

1. S ^ {Vl) ...{Vq){Vq+lb*)S, 

S (vi) . . . {Vq){Vq+lb*} 

for vi . . . VqVq+i G A, I = fc, 1 < i < g, g > 0, and \vq+ib* | = A:, t > 0, 

2. S ^ (6'=)S, 

3. {u){v) {u'){v') 

for all u, V, u', v' G |m| = |u| = |m'| = |u'| = k,u + v = u' + v' , 

4. (mi) . . . (Ur) (vi) . . . (Vr), 

for each {m), {vi) G N' , I < i < r,r > 1, such that mi -I- U 2 -I- . . . -I- =k 
v\ + V 2 + ■ ■ ■ + Vr hy a, rule u ^ v G P, 

5. (mi) . . . (Ur) (ui) . . . (Ur)(Ur+l) . . . (vt) , 

for {ui),{vj) G N',1 < i < r,l < j <t,r> l,t > r, such that U 1 +U 2 + ■ ■ ■ + 
Ur =k V 1 + V 2 + ■ ■ . + Vr + Vr +1 + ■ ■ ■ + Vf hy & rule u ^ V G P and \vi\h = 0 
for all 1 < i t — 1, Vt G |ut| = k, 

6. (u) u, for all u G {T U {6})^ with lulr > 1. 

The nonterminals of G' are multisets of weight k, maybe with multiple oc- 
currences of the object b. By rules of type 1 we introduce the axioms of G, 
splitted among nonterminals of G'; further nonterminals of the form (&^) can be 
introduced, if needed in the derivation, by rules of type 2. By rules of type 3, 
the contents of multisets in the nonterminal symbols of G' can be freely inter- 
changed. By rules of types 4 and 5 we can simulate the rules from P, by making 
use of the “free space” ; made available by the occurrences of b in the nontermi- 
nals of G' . When no nonterminal with respect to G is present in a nonterminal 
(u), then by rules of type 6 we introduce the terminal objects of u, providing 
that at least one exist; the occurrences of b are ignored. Thus, one can easily see 
that we get M{G) = M{G'). □ 

4 An Algebraic Characterization 

Another way to characterize the families mREG, mLIN, mCF, PsREG, 
PsLIN, and PsCF is to define macrosets as least fixed points (solutions) of 
corresponding systems of equations on the w-complete semiring with vector ad- 
dition as underlying operation (for the theory of semirings, see [13], [17]). This 
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can be done in a quite general manner for other operations, giving rational 
(corresponding to regular), linear, and algebraic (corresponding to context-free) 
languages/macrosets. If the underlying operation is commutative all these fam- 
ilies of languages / macrosets coincide and this is the case for vector addition on 

N". 

Let At be a monoid with binary operation o and unit element 1, or with a 
binary operation o : At x At ^ V{Ai) and unit element 1, i.e., loa = aol = 
{«}• 

Extend o to an associative operation o : V{M) xV{M) V{M), distributive 
with union U (Aol^BUC) = (Ao B)U{AoC) and (AUB)oC = (AoB)U{BoC)), 
with unit element {1} ({1} o A = A o {1} = A), and zero element 0 (0 o A = 
Ao0 = 0). 

Then S = (7^(AJ), U, o, 0, {!}) is an w-complete semiring, i.e., if Ai C 
for 0 < i then B o Ui >0 = Ui>o(B ° A,) and (Ui>o A) o B = Ui>o(^* ° B). 

Define also = {!}, A^^^> = A, = A o A^^^\ = Ufc>o 

Let A = {Xi, . . . , X„} be a set of variables such that A n Al = 0. 

A monomial over S with variables in A is a finite string of the form: A\ o 
A 2 o . . . o Afe, where A^ G A or A^ C M,\Ai\ < 00 , i = 1,. . . ,k. Without 
loss of generality, Ai = {ai} with ai & M suffices. The (or {oy}) will be 
called constants. A polynomial p(X) over 5 is a finite union of monomials where 
A= (Ai,...,A„). 

A system of equations over 5 is a finite set of equations 
£ := [Xi = Pi(A) I i = 1, . . . , n}, 

where Pi{Xf) are polynomials. This will also be denoted by A = p{X_). 

The solution of £ is a n-tuple L = (Li, . . . ,Ln) of sets over At, with Li = 
Pi{Li, . . . , Ln) and the n-tuple is minimal with this property, i.e., if L' = 
(L(,...,L(j) is another n-tuple satisfying £, then L < L/_ (where the order 
is defined componentwise with respect to inclusion, i.e., A = (Ai,---,A„) < 
(i?i, • • • , Bn) = B_ii and only if for all 1 < t < n we have Ai C Bi). 

From the theory of semirings it follows that any system of equations over S 
has a unique solution, and this is the least fixed point starting with 

^( 0 ) ^ = (0,---,0) = 0, and =p(a(*)). 

Then the following fact holds: for 0 < t. 

This is seen by induction and the property of the polynomial with respect to 
inclusion, as 0 < A^^^ and x(*+^^ = p(X^*'^) < p(a(*+^)) = X^*+^\ 

A general system of equations is called algebraic; it is linear if all monomials 
are of the form A o A o B or A, and rational if they are of the form A o A or A, 
with ACM and B C M. Corresponding families of macrosets (solutions of such 
systems of equations) are denoted by ALG(o), LIN(o), and RAT(o). In the case 
o is commutative then all families are identical: ALG(o) = LIN(o) = RAT(o). 

Interpreting the variables of the system of equations as non-terminals, then 
also rational (regular), linear and algebraic (context-free) grammars can be de- 
fined. They generate the same languages as those defined by least fixed points. 
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Since the operation of vector addition is commutative, it follows that rational, 
linear and algebraic languages coincide. In this way, we have again obtained the 
corresponding equalities from Theorem 1. 

5 Conclusion 

Of course, many other problems remain to be investigated; the whole program 
of formal language theory can/must be repeated for macrosets. For instance, we 
can define pure, distributed (like in grammar system area), or parallel (like in 
Lindenmayer and in P system area) multiset grammars; also other regulating 
mechanisms than the matrix one can be considered. We leave to the interested 
reader the task to make the further step toward Formal Macroset Theory. 
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Abstract. We investigate closure properties of multiset languages, de- 
fined by multiset grammars. To this aim, this abstract families of multi- 
set languages are considered, as well as several normal forms for multiset 
grammars. Furthermore, a new definition of deterministic finite multiset 
automata is proposed. 



1 Introduction 

Recently [4] commutative grammars, i.e., multiset grammars, have been reinves- 
tigated for the description of new computational models inspired from biology 
and biochemistry [5]. 

In this paper we investigate closure properties of such multiset languages 
under multiset operations corresponding to those for word languages. 

In Section 3 we present various normal forms for multiset grammars following 
the lines for word grammars, and a characterization of regular multiset languages 
by an expression of iteration height 1. This differs entirely from the corresponding 
star height of regular word languages. 

In Section 4 we also introduce a variant of a deterministic finite multiset 
automaton [1] which is equivalent to the nondeterministic version. 

To show some closure properties, especially under inverse homomorphism 
and iteration, abstract families of multiset languages are considered in Section 

5. 

In Section 6 we present all results obtained so far. Still open are closure under 
set difference and complement for monotone and arbitrary multiset languages, 
as well as under 0-free substitution and homomorphism for monotone multiset 
languages. Interesting is also the closure under other multiset operations. 

2 Multisets and Semilinear Sets 

Definition 2.1. (Multisets) Let V = {ai,---,a„} be an alphabet. A multiset 
over V will either be denoted by a; = {/j,x((1i) • oi, • ’ ’ > l^x(an) • On)) where jJLx(<ii) 
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is the multiplicity of a^, or as a vector x = • • • , /Xa;(a„)) G W”. Another 

notation sometimes used is with a° = A (the empty word ). Let 

the set of multisets over V be denoted by V® . The neutral element is denoted 

by 0. 

If a: is a multiset define <j{x) = as its norm or length or weight. 

Write 5 G x if gLxi,C) > 0- 

For two multisets x = (nxicii) • Oi | > 0) and y = {y,y{ai) • | i > 0) define 

X C y iff Vt > 0 : fixi^i) < h'yicii)- Analogously, define z = x + y by yz(ai) = 
tix(ai) + giy{ai) for i > 0, and z = x - y by Hzi,ai) = max{0, ^3,(0,) - gLy{ai)}. 

Define also z = x n y and z = x U y by yz{cii) = rnin{yx{(ii), /j,y{ai)} and 
Hzi,ai) = max{yxi.ai),tJLy{ai)}, respectively. 

The next definitions follow [2]. 

Definition 2 . 2 . {Semilinear sets) A linear set M over FV" is defined by a 
multiset c {constant) and finite set P C FV" = {pi, ■ ■ ■ ,Pm} of multisets {periods) 
as follows: 

M = {c+ % ■ Pj I % G ^}- Note that M = {c} + P®. 

A semilinear set M over £V" is a finite union of linear sets, with sets of 
periods Pi = fei, • • • 

M = Mi with Mi = {a + kij ■ pij \ hj G M}. 

Therefore, M = Ui=i({ci} + P®)- 

Denote by sLin the family of all semilinear sets. 

3 Multiset Grammars 

Definition 3 . 1 . A multiset grammar is a quadruple G = {N,T, A, P), where 
N, T are disjoint alphabets, the nonterminal and the terminal one, respectively, 
with V = N UT. A C V® is a finite set over V (its elements are called axioms), 
and P is a finite set of multiset rewriting rules (in short, productions) of the form 
Pi P2, where pi,p2 are multisets over V and I^iIat > 1. 

For two multisets 01,02 over V, write ai^Ao!2 for some r : yi ^ /Z2 G P 
if Pi C oi and «2 = (oi — pi) + P2- r is understood, then write — > instead 
of . Denote by the reflexive and transitive closure of the relation — >. 
The multiset language generated by G is defined by 

M{G) = {/3 G T® I (3, for some a G A}. 

A natural Chomsky-like classification of such grammars is the following one: 

1. Grammars G as above are said to be arbitrary. 

2. If |/ii| < |y.2| for all rules pi p2 in P, then G is said to be monotone. 

3. If |/ii| = I for all productions pi p2 in P, then G is said to be context-free. 

4. If \pi\ = I and \p2\N < 1 for all productions p\ p2 in P, then G is said 
to be linear. 

5. If \pi\ = I, \p2\ < 2, and \p2\N < 1 for all productions p\ p2 in P, then 
G is said to be regular. 
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Denote by mARB, mMON, mCF, mLIN, mREG the families of macro- 
sets generated by arbitrary, monotone, context-free, linear, and regular multiset 
grammars, respectively. 

Lemma 3.1. To each multiset grammar G there exists an equivalent one G' with 
A' = {S'} and S G N . If 0 G M{G) for a monotone grammar then S ^ 0 G P' 
and S does not appear in any right hand side p ,2 of P' . 

Proof. Let G = (N,T,A,P) and A = {ai, ■ ■ ■ , am}, and S ^ N. Define 
G = (A U (Sj, T, (Sj, P') by P' = P U (S ^ Oj I 1 < j < m|. If 0 G M(G) for 
monotone G, let also S ^ 0 G P' . Then M(G') = M{G). □ 

Lemma 3.2. {Separated Form) To each context-free, monotone or arbitrary mul- 
tiset grammar there exists an equivalent one in separated form, i.e., all produc- 
tions are of the forms ni /i 2 with pLi G A® and either yi 2 G A® or qi 2 G T or 
pL2 = 0 . 

Proof. Let G = (A, P, {S},P). Let G be already in form of Lemma 3.1. 
Define G' = (A U P, T, |S}, P'), and the homomorphism h by h{a) = d for 
a G T, h{A) = A for A e A. Let P' = ^ ^(^ 2 ) 

a I a e P|. Then M{G') = M{G). □ 

Lemma 3.3. The multiset grammars can he transformed into equivalent multiset 
grammars with productions in normal forms. 

Proof. Since -I- is commutative, mREG=mLIN=mCF. 

Regular multiset grammars can be transformed into equivalent ones with 
productions of the forms A — > P -|- a, A ^ a, and S ^ 0 in which case S does 
not appear on any right hand side of a production. 

Let G = (A, P, (SjjP) a regular multiset grammar. If P ^ 0 G P with 
P yf S then construct a new regular multiset grammar G' = (A, P, |S},P') 
with productions P' = (P — |P ^ 0}) U {A ^a|A— >P-|-aG P}. Then 
M(G') = M(G). Repeat this procedure. Since |{P— >0 gP}|<oo this proce- 
dure terminates, either without any erasing production or with the only erasing 
production S ^ 0. 

For a monotone multiset grammar add new symbols 
|(P, A) I A G A} U {{Gri, A) I r G P, 1 < f < m{r),A G A} 

U{(Gh, Ark) \ r G P,l <i,k < m(r), A G A} 

U{Gri, A) I r G P, 1 < f < n{r)} U {Gri, Ark) \ r G P,1 < i,k < n(r)|, 
and if r : u ^ v G P with u = A^i -I- • • • -I- Arm{r), v = -Bri -I- • • • -I- Bmir), 
m(r) < n(r) is a production replace it by new productions 
(P,A)^(G,i,A), 

{Crij ^ (C^r,z+l5 + ^ri (1 ^ ^ < m(r)), 

{Crm{r) ? “1“ ^ (^rl ? “1“ ^rm(r) •> 

{Cfk j ^ (C^r,fc+1 ) (^7^ Th(t ) ) , 

{Crii -^rk') ^ -^ri {k <C. i <i 777'(r)), 

{Crm(r)') -^m(r) ^ (G^rl j ) j 

{Grj, A) + Arj (Gr-^j+i, A) Bj (1 < j < m(r)), 
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{^rj } -^rk) ^rj -(G r,j + l) ^rfc) + Bj (1 < J < k), 

{.Grki^rk) ^ {G r^k+1 : B y 

(Grj,A) (Gr,j+ 1 ,A) + Bj (m(r) < j < n{r)), 

{G rn{r) 5 A) {R,A) + Br,(r), 

{R,A)^A, 

{R, A) + B ^ {R,B) + A for A,B eN. 

Note that this also works for m(r) = 1. 

In the next step, if A ^ _B is a production, replace on all right hand sides of 
productions Ahy A ov B, and remove A ^ B. 

Then all productions are of the forms A + B G + D, A B + G, or A a, 

and the generated multiset languages are identical. 

For an arbitrary multiset grammar add new symbols 

{R, A} U {Cri \ r € P,l < i < m{r)} U {Eri \ r G P,1 < i < m{r)} 

\j{Grj \ r G P,l < j < n{r)} 

and a r : u ^ V G P, with u = A\ + • ■ • + ^m(r)) v = Bi + ■ ■ ■ + B„,(r) is a 
production, replace it by new productions 
R^Grl, 

Gri + Ai — > Gj-j+i + Aj-i (1 < z < m(r)), 

Grm(r) P ^m(r) ^ “t” 

Eri + Ari Erj +1 + A (i < i < m(r)), 

Brm{r) P ^rm(r) ^ G7.1 + Ay 

Grj ^ GrJ-jl P Bj (1 ^ J < 7z(r)), 

Grn{r) ^ B P -^n(r)? 
yl ^ 0, i? ^ 0. 

Note that this also works for m(r) = 1. 

In the next step, if A — > B is a production, replace on all right hand sides of 
productions A by A or B, and remove A B. 

Then all productions are of the forms A + B ^ G + D, A ^ B + G, A a, 
or A ^ 0, respectively, and the generated multiset languages are identical. □ 

Lemma 3.4. mREG = sLin. 

Proof, a) mREG C sLin is Parikh’s theorem [2], Theorem 5.2.1 since 
mREG = PsCF. 

b) Let M = Mi G sLin with Mi = {c* + ' Pij I % ^ 

Construct the regular multiset grammar G = {N, T, S, P) with P consisting 
of the productions S ^ Si, Si ^ Ai + Ci, Ai ^ Ai+pij, Ai ^ 0 where 1 < z < s 
and 1 < j < m(z). 

Then M = M{G), and therefore sLin C mREG. □ 

Lemma 3.5. {Normal form o/mREG) Any M e mREG can be characterized 
in the normal form M = + B®) with multisets Ci and finite sets of 

multisets Pi. 

Thus the (B-height of any M G mREG is at most 1. 

Proof. This follows immediately from Lemma 3.4 and the remark for the 
definition of semilinear sets. □ 
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4 Finite Multiset Automata 

Definition 4.1. (Finite Multiset Automaton) A Finite Multiset Automaton 
(FMA) is a quintuple A = {V,Q,qo,QF,S) with a finite alphabet V, a finite 
set of states Q, an initial state qo, a set of terminal states Qf C Q, and a finite 
set of instructions S C Q x F® x Q. 

The automaton works in such a way that if /r € F®, ^ G F® with ^ > 0, 
& S, fj, — ^ > 0, then fj, G F® is reduced to /r' = /r — 

A accepts a multiset /x if /i is reduced in finitely many steps to 0, and A is 
in a terminal state. The multiset language M{A), accepted by a FMA A consits 
of all accepted multisets. 

Note that due to the commutativity of the basic operation + for multisets 
the application of instructions {q, a, q') and (g, b, q") is non-deterministic. 

But a FMA is called deterministic (DFMA) if (5 is a function S : Q xV ^ Q. 
Otherwise it is called non-deterministic. 

As for classical finite automata a number of theorems can be shown to bring 
a FMA to a simple form. 

Lemma 4.1. To any FMA A = (F, Q, q^, Qp, S) there exists an equivalent FMA 
A' = (F, Q', qo^Qp, S') with S' C Q' x {V U {0}) x Q' . 

Proof. As for classical FA on words replace (p, f,q) G S with f 
{ k>l, Uj e V) by a series p,ao,qi), {qj,aj,qj+i (1 < j < k), (qk,ak,q), all in 
S', and add {qj | 1 < j < A:} to Q. Then M{A') = M{A). □ 

Lemma 4.2. {Removal of 0) To any FMA A = {V,Q,qo,Qp,S) an equivalent 
FMA A' = {V,Q,qo,Q'p,S') can be constructed, with S' C Q x V x Q. 

Proof. Construct a relation A — 0 as follows: 

Mo = {{p,q) e Q X Q \ {p, 0, g) G <5} U {(g, q) q G Q}, 

Mi+i = MiU {{p,q) G Q X Q \ 3q G Mi : {p, 0, q) G 5}. 

Since |Q x Q| < oo there exists a k such that Mk +j = Mk for j > 0. Define 
Aq = Mfc. Actually, Aq = Mg. 

Construct S' by {q,q,q') G S' iff there exist p,p' G Q with (q,p) G Aq, 
{p',q') G Ao, {p,a,p') G S. 

If {do,qf) G Ag with qf G Qp let qo G Q'p. 

Then M{A') = M{A). □ 

Lemma 4.3. (Deterministic FMA) To any FMA A = {V,Q,qo,Qp,S) there 
exists an equivalent complete DFMA A' = {V,Q',q'o,Q'p,S'). 

Proof. As for FA on words, using Lemma 4.2, construct the powerset au- 
tomaton with Q' = V{Q) by S'{Qi,a) = {q 2 G Q \ 3qi G Qi : ( 91 , 0 , 92 ) G <5}, 

q'o = { 90 }, Q'p = {Qi G P{Q) \ QiPQp ^ 0}. 

Let Q' consist of those Qi G V{Q) which are reachable from q'o, joined 
with {0}. If there is no Q 2 with S'{Q\,b) = Q 2 define S'{Q\,b) = 0, as well as 
S'{%, a) = 0 for all a gV. 

Then A' is a complete DFMA with M{A') = M{A). □ 
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Lemma 4.4. The family of multiset languages accepted by FMA is identical to 

mREG. 

Proof, a) Let A = (V,Q,qo,QF,d) be a FMA. By Lemma 4.3 it can be 
assumed that A is a complete DMFA. Construct a regular multiset grammar 
G = {N, T, {5"}, P) as follows: 

Let N = QU {S"} and T = V. For S{q, a) = q' let q ^ q' + a € P {q ^ qo), 
and if S(qo, a) = q' let S' ^ q' + a & P.li qo & Qp let also S' ^ 0 G P. 

Then M(G) = M(A). 

b) Let G = (N, T, S, P) be a regular multiset grammar in normal form. Construct 
a FMA A = (y, Q, qo,Qp, d) as follows: 

Q = N U {qf} and V = T. For A — > i? + a G P let d(A,a) = B, and for 
A ^ a G P let (5(A, a) = g/. If S' — > 0 G P let S' G Qf- 

Then M(A) = M{G). □ 

5 Abstract Families of Multiset Languages 

In this sections some theorems on abstract families of multiset languages are 
presented. This is done in a similar way as for abstract families of word languages. 

Definition 5.1. A homomorphism h : V® is a mapping with properties 

h{0) = 0 and + ^ 2 ) = h{p,i) + h{pL 2 ). 

A homomorphism is called 0-free if h{a) yf 0 for all a G Vi. 

Definition 5.2. A substitution a : V® 2 ^ 2 ® is a mapping with properties 
cr(0) = {0} and a{p,i + ^ 2 ) = cr(/ri) + ct(^ 2 )- 

A substitution is called 0-free if 0 ^ a (a) for all a € Vi. 

Definition 5.3. Let M C T® be a multiset language and h : T ^ T® a 
homomorphism with V/i G M : |/x| < k\h{pi)\. Then h is called k-linear erasing 
with respect to M. 

A family M of multiset languages is closed under linear erasing if h{M) G M 
for all M G A4 and k-linear erasing homomorphisms h with respect to M. 

The proofs of the next theorems follow those for abstract families of (word) 
languages in [Sal]. 

Definition 5.4. Let M C (T + {0, c, • • • , with c ^ T, and h : TU {c} 

T® a homomorphism defined by h{c) = 0 and h{a) = a for a € T. 

Then h is called k-restricted on M. Obviously, V/x G M : j^j < k\h{p.\). Thus 
h is also k-linear erasing on M. 

A family A4 of multiset languages is closed under restricted homomorphism 
if h{M) G A4 for all M G A4 and fc-restricted homomorphisms h on M. 

Theorem 5.1. If a family A4 of multiset languages is closed under 0-free substi- 
tutions, restricted homomorphisms, union with regular multiset languages, and 
intersection with regular multiset languages, then A4 is closed under inverse ho- 
momorphisms. 
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Proof. Let M C T®, Ti = {ai, • • • , a„}, and h : T ^ T® be a homomorphism 
with h{ai) = ai G T® (1 < i < n). 

a) Assume 0 ^ M G M. Let k = max{\ai\ \ 1 < f < n} + 1. Define the set 
T = {oi, • • • , oA}, and the regular substitution a by a{a) = {a} + T . 

Let Ml = {ai + Oi | 1 < i < n}, a finite set, and M 2 = a{M) n M®. Then, 
by the closure properties, M G Ai. 

Define a 0-free homomorphism hi : TUT ^ TU{c} by hi(a) = a, hi{a) = c 
for a GT. Then hi{M 2 ) C (T U {c})® and hi{M 2 ) G Ai. 

Define another homomorphism ft -2 : TU {c} —> T by / 12 (c) = 0 and / 12 (a) = a 
for a G T. h 2 is /c-restricted on hi{M 2 ) since with j3 = J2JLi(^i(j) + 
follows hi(P) = hi(J2jLi(ai(jj+a^(^j))) = E^i(oi(iy+c'“'<^’'), and /i2(/ii(/3)) = 
J2jLi Therefore \hi{P)\ < k ■ m = k ■ \h{hi{(3))\. 

Furthermore, /i 2 (^i(-^ 2 )) = h~^{M), since 
(3 G h~'^{M) h{(3) = X)”Li ai(j) EjLi(ai(T + «hi)) ^ ^2 

^ EjLi e h2{hi{M)). 

Thus h~^{M) G Ai. 

b) 0 G M. 

Then M— {0} = Mn(T+T®) G Ai, and therefore h~^{M — {0}) G Ai. If h is 
0-free then /i”^(0) = {0}, a regular multiset language. Otherwise, h~^(0) = T® 
for some T 2 C T, also a regular mulriset language. Now, 

h~^{M) = h~^{M — {0}) U /i“^(0). Hence h~^{M) G Ai. □ 

Theorem 5.2. If a family Ai of multiset languages contains mREG, is closed 
under intersection with regular multiset languages and substitutions (0-free sub- 
stitutions), then Ai is closed under iteration ®. 

Proof. Ai contains {c}®. If M G A4, then with the substitution (0-free 
substitution) <t(c) = M also M® G Ai. 

Note that actually closure under intersection with regular multiset languages 
is not needed. □ 

6 Closure Properties 

Lemma 6.1. mREG, MON and ARB are closed under union. 

Proof. Let Gi = {Ni,Ti, Si, Pi) with i = 1,2 be two multiset grammars in 
separated form, generating M{Gi). 

Construct G = {Ni U A 2 U {S'}, Ti U T 2 , S, Pi U E 2 U (S ^ Si, S ^ S 2 } where 
P 2 denotes the set of productions P 2 with every nonterminal x from N 2 changed 
into X. Then G is also in separated form and of the same type as Gi. 

Obviously, M{G) = M{Gi) U M{G 2 ). □ 

Lemma 6.2. mREG, MON and ARB are closed under the operation -I-, 

Proof. Let Gi = {Ni,Ti, Si, Pi) with i = 1,2 be two multiset grammars in 
separated form, generating M{Gi). 
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Construct G = {Ni U N2LI {S'}, Ti U T2, S, Pi U V2 U (S ^ Si + S2} with P2 
as above. Then G is also in separated form and of the same type as Gi. 

Obviously, M{G) = M{G\) + M{G2)- □ 

Lemma 6 . 3 . mREG, MON and ARB are closed under intersection. 

Proof. Let Gi = {Ni,Ti, Si, Pi) with i = 1,2 be two multiset grammars in 
separated and normal form, generating M{Gi). 

For multiset languages from mCF this is [2], Theorem 5.6.1, since mCF= 

mREG. 

For multiset languages from mMON construct a new grammar G={N,T,S,P) 
with A = (Vi U {•}) X (V2 U {•}), T = Ti n T2, S = (^^), and P as follows: 

If B + G e Pi then put (^) ^ (|) + (^), (^) + (*) ^ (^) + (^), into 

P, with E, F G V2 U {•}. 

If A^B + GgP, then put Q ^ (|) + (*), (^) + Q ^ Q + (^) into 
P, with P, P G Vi U {•}. 

If A + B^G+Dg Pi then put (^) + (^) ^ (^) + (^) into P, with 
E, F (z: V2 U {•}. 

If A + B^G+D£P2 then put (^) + (^) ^ (^) + (^) into P, with 
P,PG FiU{.}. 

If A ^ a G Pi then put (^) ^ (|.) into P, with P G V2 U {•}. 

If A ^ a G P 2 then put (^) ^ (^) into P, with P G Vi U {•}. 

Put also into P the productions (^) + (^) ^ (^) + (^) , with A,B G Pi U {•}, 
P,P G P2 U {•}, S ^ (fi), (•) ^ •, and (“) ^ a, with a G P. 

In any derivation of some /i G P® at most |/i| — I *’s are generated in the 
upper part, as well in the lower part, namely only by productions A — > P + G. 

Consider the generated multiset language M{G) C (PU (•})®. By definition 
M{G) G mMON. The homomorphism h : TU {•} ^ P defined by h{») = 0 
and h{a) = a for a G P is a linear erasing homomorphism since |/x| < 2|/i(/i)|. 

By Theorem 6.1, h{M{G)) G mMON, and trivially h{M{G)) = M{Gi) C 
M(G2). 

For languages from mARB construct 

G = (P{ U P2 u {S'}, Pi n P2, S, U P2 U {S ^ S{ + S2} U {P + X ^ X I a; G 
PinP2}). 

Then M(G) = M(Gi)nM(G2). □ 

Corollary 6 . 1 . mREG, MON and ARB are closed under intersection with 
regular multiset languages. 

Lemma 6 . 4 . mREG and ARB are closed under substitution a, mMON is 
closed under 0-free substitutions. 

Proof. Let G = (A, P, S, P) be a multiset grammars in separated form, gen- 
erating M{G). 

Let cr(a) be generated by a multiset grammar G„ = {Na,Ta, Sa, Pa) of the 
same type as G, in separated form, with A„ C A;, = 0 for a yf 6 and A C A^ = 0 
for a GT. 
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Construct a new multiset grammar G" = {N' ,T' , S, P') with N' = N U 
UaeT^a, r = [JaGTTa, and P' = (P - {A ^ o | a G T}) U {A ^ S', | a G 
T} U UaGT 

Then, in all cases, M{G') = a{M{G)). □ 

Corollary 6.2. mREG and ARB are closed under homomorphisms, mMON 
is closed under 0-free homomorphisms. 

Theorem 6.1. mMON is closed under linear erasing, and therefore also under 
k-restricted homomorphisms. 

Proof. Let M G mMON, h a fc-linear erasing with respect to M with h : 
T T® . Assume k > 1, since otherwise only M = 0orM={O} are possible. 
Let h{M) C T®. Trivially, h{M) G mARB since mMON is closed under 
homomorphisms. Let G = (V, T, S, P) be a with M{G) = M in separated form. 
Then the grammar Gi = (V, Pi, S, Pi) with Pi = (P — {A ^ a \ a G T}) U {A ^ 
h{a) I a G P} generates h{M) 

Now, let p .1 G h{M). Then there exists a, p, G M with h{p) = p\. Since 
h is ^-linear erasing on M, \p\ < k ■ \h{p)\ = k ■ \pi\, and therefore it follows 
thatlLS(/xi) = max{W S{p), |/ii|)} = max{\p\, |/ii|} < k • \pi\. 

Hence, by the workspace theorem, h{M) G mMON. □ 

Lemma 6.5. mREG, mMON, and mARB are closed under inverse homo- 
morphisms. 

Proof. This follows from Lemmata 6.1, 6.4, Corollary 6.1, 6.2, and Theorems 
6.1, 5.1. □ 

Lemma 6.6. mREG, mMON, and mARB are closed under iteration ®. 

Proof. This follows from Corollary 6.1, Lemma 6.4, and Theorem 5.2. □ 

Lemma 6.7. mREG is closed under set difference, i.e., if Mi, M 2 G mREG 
then Ml — M 2 G mREG. 

Proof. This is [2], Theorem 5.6.2. □ 

Corollary 6.3. mREG is closed under complement, i.e., if M G mREG then 
T® - M G mREG. 

Proof. Since trivially P® G mREG this is a consequence from Lemma 6.7. 

□ 

For two multiset languages let Mi A M 2 = {pi C ^2 | Mi G Mi, M 2 G M 2 } and 

Ml V M 2 = {mi U M 2 I Ml C Ml, M 2 C M 2 }. 

Lemma 6.8. {Closure properties of PsCS = PsMON) PsCS is closed under 
union, +, iteration ®, intersection, complement, set difference, 0-free homo- 
morphisms, and substitutiona, inverse homomorphisms, and V, but not under 
homomorphisms and substitutions. 

Proof. Let tp{u) denote the Parikh vector of a word u. Since GS is closed under 
U, •, *, and if {A UB)= if {A) U if{B), if {A ■ B) = if {A) + if{B), if {A*) = if {A)®, 
PsGS is closed under U, +, and ®. 
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PsCS is closed under n, since to any L G CS another language L' G CS 
with tp(L') = 'ip(L) can be constructed by adding productions UiGj ajUi to 
the productions of the grammar for L. Then L” = L' O {oi}* • • • {a„}* G CS 
and all v G L' have the form v = al' ■ ■ ■ where T = Thus 

to each u G L there exists a unique 4>{u) G L" with 4’{u) = i/'(</>(u)) being a 
representation of By this one gets a representation of as 4>{L) G CS. 
Let Li,L 2 G CS. Then (/)(Li), ^(^2) € CS are representations of 'tp{Li),ip{L 2 ). 
Since n 4'{L2) G CS it follows immediately that 'ip(Li) fl ip{L 2 ) G PsCS. 

PsCS is closed under complement. Now L G CS implies tt{L) G CS where 
tt{L) = {u G T* \3v G L : tp{v) = i.e., 7 t(L) consists of all permutations 

of words in L. Trivially, Since CS is closed under complement, 

T* - 7 t(L) G CS, and therefore T® - = HT* ~ 7 t(L)) G PsCS. 

Because of Mi — M 2 = Mi n (T® — M») it follows that PsCS is also closed 
under set difference. 

Let cr(ai) = Xi G PsCS be a 0-free multiset substitution. Then there exist 
Yi G CS with = Xi. Define a A-free word substitution r(oi) = Yi G CS. 

Then 'ip{T{ai)) = tp(Yi) = Xi = <j{ai). Therefore, with u = xi - • ■ Xm follows 

m mm 

V'(r(w)) = • •r(x™)) = ^ = XI • 

i=i i=i i=i 

With M = ij}{L) and since t(L) G CS it follows that a{M) = a{ip{L)) = 
V'(t(L)). 

By Theorem 5.1, PsCS is also closed under inverse homomorphisms. 

PsCS is closed under V. Let Mi, M 2 G PsCS with Mj = Lj G CS, 

Lj C T* , and T = {ai,---,a„}. Then also 4>{Lj) G CS with defined as 

above. 

Assume 0 ^ Mj. Let g(ai) = bi, h{ai) = Cj, and Ti = g{T), T 2 = h{T). Then 
L' = ag{Li)bh{L 2 )c G CS with {a, b, c} n (T U Ti U T2 = 0. V consists of words 

Extend a LBA accepting L' by additional instructions checking successively 
for each i whether > Si or Vi < Si. In the first case it changes all bi into di and 
all Ci into Cj, in the second case all bi into e* and all Ci into di. The resulting LBA 
accepts a language L” C {a}T^ {b }^3 {c} where T3 = {di, • • • , d„}U{ei, • • • , e„}. 

Define a homomorphism / by /(d*) = a*, /(e*) = /(a) = f{b) = /(c) = A. / 
is fc-limited erasing on L" since 1^1 <2- |/(u)| + 3 < 5- |/(■u)|. Then f{L") G CS, 
and ip{f{L")) = V’(Li) V 

If 0 G Ml or 0 G M 2 then Mi V M 2 = {M[ V M^) U {0} G PsCS where 
0 ^ nM^. 

PsCS is not closed under homomorphisms and substitutions. 

Consider M = ijj{L) G PsRE with L G RE, L G T*, M ^ PsCS. Then 
there is a effectively constructable L' G CS with L C T*, Ti = T U {A}, and a 
homomorphism h with h{x) = x for x G T, h{A) = A, and L = h{L'). 

Consider (j){L') G CS with 0 as above such that \j}{4>{L')) = ip{L') G PsCS. 
Then = ip{L) = h{ip{L')) = M. But then closure of PsCS under 

homomorphism would imply M G PsCS, a contradiction. 

Therefore PsCS is not closed under homomorphism and substitution. □ 
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Lemma 6.9. {Closure properties of PsRE) PsRE is closed under union, +, it- 
eration® , intersection, homomorphisms, substitutions, inverse homomorphisms, 
V, A, but not under complement and set difference. 

Proof. Let if{u) denote the Parikh vector of a word u. Since RE is closed 
under U, •, *, and ip{A [J B) = ip{A) U f>{B), 'if{A ■ B) = 'ip{A) + if{B), f>{A*) = 
'ip{A)®, PsRE is closed under U, +, and ®. 

PsRE is closed under n, since to any L G RE another language L' G RE 
with ip{L') = 'ip{L) can be constructed by adding productions UiOj ajOi to 
the productions of the grammar for L. Then L” = L' O {oi}* • • • {a„}* G RE 
and all v G L' have the form v = where T = Thus 

to each u G L there exists a unique 4>{u) G L" with f’{u) = i!}{4>{u)) being a 
representation of 'if{u). By this one gets a representation of '4>{L) as (j){L) G RE. 
Let Li,L 2 G re. Then 4>(Li) , (f(L 2 ) € RE are representations of 'ip{Li),'if{L 2 ). 
Since 4>{Li) n (f>{L 2 ) G RE it follows immediately that tp{Li) fl V'(^ 2 ) G PsRE. 

PsRE is of the power of Turing machines since any word u can be encoded 
as a multiset /i(w), even as p.{u) G N such that any L G RE can be represented 
by a multiset language p,{L) G PsRE. Therefore PsRE is not closed under set 
difference — and complement “. 

Let a{ai) = Xi G PsRE be a multiset substitution. Then there exist mul- 
tisets Yi G RE with 4’{Yi) = Xi. Define a word substitution T{ai) = Yi G RE. 
This gives ip{T{ai)) = if{Yi) = Xi = a{ai). With u = xi ■ ■ ■ Xm follows 

m mm 

V’(r(u)) = V’(t(xi) • • • T{xm)) = f’irixj)) = Y = ^(Y = ct(V'(m)). 

j=i j=i j=i 

With M = '4>{L) and since t{L) G RE it follows that a{M) = a{ip{L)) = 
V'(r(L)). 

Thus PsRE is also closed under homomorphisms. 

By Theorem 5.1 PsRE is also closed under inverse homomorphisms. 

PsRE is closed under V. Let Mi, M 2 G PsCS with Mj = il}{Lj), Lj G RE, 
Lj C T* , and T = Then also <f{Lj) G CS with 4>{Lj) defined as 

above. 

Let g{ai) = bi, h{ai) = Cj, and T\ = g{T), T 2 = h{T). With new symbols 
{a, b, c} n (r U Ti U T 2 ) = 0, it follows L' = ag{Li)bh{L 2 )c G RE. V consists of 
words ■ ■ ■ ¥^bCf ■ ■ ■ df^c. 

Extend a TM accepting L' by additional instructions checking successively 
for each i whether ri > Si or ri < Si. In the first case it changes all bi into di and 
all Ci into Cj, in the second case all bi into Cj and all Ci into di. The resulting TM 
accepts a language L” C {a}T{l {b}T{l {c} where T 3 = {di, • • • , d„}U{ei, • • • , e„}. 

Define a homomorphism / by f{di) = Oj, f{e,) = f{a) = f{b) = /(c) = A. 
Note that the result can also be A. Then f{L") G RE, and ip{f{L")) = '4){L{) V 
tp{L2). 

By a similar argument PsRE is also closed under A. □ 

Lemma 6.10. To each M G mARB there effectively exist a M' G mMON 
and a homomorphism h such that M = h{M'). 
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Proof. Consider a multiset grammar G = {N, T,{S},P) with generated lan- 
guage M{G) = M G mARB. Replace any production A^OsPbyA^yl 
with A ^ N U T, generating a multiset language M' G mMON. Defining the 
homomorphism h by h(x) = x for x G T, h{A) = A, it follows that M = h{M'). 

□ 

Lemma 6.11. mMON is not closed under homomorphisms and substitutions. 

Proof. This follows immediately from Lemma 6.11. □ 

The results so far obtained for closure properties of multiset languages are 
given in the following table where also the families PsCS and PsRE of multisets 
are added. 

Theorem 6.2. The previous table shows the known closure properties of families 
of multiset languages. 

The places not filled are open problems. 
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Abstract. Database systems cope with the management of large groups 
of persistent data in a shared, reliable, effective, and efficient way. Within 
a database, a multiset (or bag) is a collection of elements of the same 
type that may contain duplicates. There exists a tight coupling between 
databases and multisets. First, a large variety of data models explicitly 
support multiset constructors. Second, commercial relational database 
systems, even if founded on a formal data model which is set-oriented 
in nature, allows for the multiset-oriented manipulation of tables. Third, 
multiset processing in databases may be dictated by efficiency reasons, 
as the cost of duplicate removal may turn out to be prohibitive. Finally, 
even in a pure set-oriented conceptual framework, multiset processing 
may turn out to be appropriate for optimization of query evaluation. 
The mismatch between the relational model and standardized relational 
query languages has led researchers to provide a foundation to the ma- 
nipulation of multisets. Other research has focused on extending the rela- 
tional model by relaxing the first normal form assumption, giving rise to 
the notion of a nested relation and to a corresponding nested relational 
algebra. These two research streams have been integrated within the con- 
cept of a complex relation, where different types of constructors other 
than relation coexist, such as multiset and list. Several other database re- 
search areas cope with multiset processing, including view maintenance, 
data warehousing, and web information discovery. 



1 Introduction 

There exists a tight coupling between databases and multisets. A database is a 
collection of persistent information that is managed by a software package called 
database management system (DBMS) or, simply, database system. A database 
may be very large in nature. Consequently, a DBMS is required to manage infor- 
mation not only in main memory but, more specifically, in mass memory. Usually, 
a database is shared, that is, different applications and users are permitted to 
access the same database concurrently. This way, both redundancy and incon- 
sistency of data can be avoided. A DBMS provides several other capabilities, 
including support for recovery and privacy of data. 

A database is defined according to a data model, which provides means of 
data organization based on certain design patterns. A data model supports struc- 
turing constructs that are similar to the type constructors of general-purpose 
programming languages, like record, array, set, and file. 



C.S. Calude et al. (Eds.): Multiset Processing, LNCS 2235, pp. 147—215, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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Throughout the history of database technology, several data models have 
been defined, including the hierarchical data model, the network data model, the 
relational data model, and the object-oriented data model. All of them are called 
logical data models, to highlight that the relevant data structures refiect a specific 
organization, such as a tree, a graph, a table, or an object. Usually, a database 
is first described based on a conceptual data model, which is independent of the 
specific organization of data, such as the entity-relationship data model. Such a 
conceptual model is then mapped to a logical data model. 

A database consists of a schema and an instance. The database schema de- 
fines the structure of data, which is expected to be essentially invariant with 
time. The database instance is the actual value of data, which is instead time- 
varying. The database changes its state when data are modified either by deleting 
or updating existing information, or by inserting new information. 

The content of a database can be defined, accessed, and manipulated by 
means of a database language, which usually provides both data definition and 
data manipulation constructs. By ‘data definition’ we mean specifying the data- 
base schema. By ‘data manipulation’ we mean either querying or changing the 
database instance. For the DBMS to be usable, it is expected to retrieve and 
modify data efficiently, whatever the data model. 

To simplify user interaction with the system, the same information can be 
organized at different abstraction levels. Specifically, the physical level describes 
how the data are stored in persistent memory in terms of complex low-level data 
structures. The logical level describes what data are incorporated within the 
database and what relationships exist among such data. Finally, the view level 
describes what part of the database can be seen by each class of users, as several 
views can be provided for the same database. 

For the different abstraction levels to work as whole, appropriate mappings 
from the view level to the logical level, and from the logical level to the physical 
level, are required. Considering a relational DBMS, a view is mapped to one or 
several relations (tables) of the logical level, while a relation is mapped to one 
or several low-level data structures of the physical leveb. 

Drawing now our attention to why multisets and databases are related to 
each other, we observe that, first, several DBMS data models support multi- 
set constructors [15, 47, 28, 45, 36, 44, 23, 12]. Consequently, database query 
languages are affected by the use of duplicates. 

Second, there exist practical multiset-oriented query languages that have 
been implemented based on previous formalization of some set-oriented lan- 
guages. Considering relational database technology, relational DBMSs have been 
developed upon the mathematical notion of a relation and the formalization of 
set-oriented query languages, namely relational algebra and relational calculus. 
However, relational DBMSs, almost invariably extend the nature of relations to 
multisets and support query languages with additional expressive power. 



^ The physical level incorporates in general a variety of additional data structures that 
are invisible at the logical level, such as, for example, the index of a relation, which 
allows for fast query evaluation. 
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Finally, dealing with multisets rather than with sets is generally bound to 
speed up query evaluation, as there is no need for duplicate elimination, an 
operation which can be extremely costly in large databases. Note that, in this 
case, multiset processing may be confined to optimization issues within the query 
evaluation, even in set-oriented queries, where duplicates are removed only when 
the set-oriented semantics of a query would be otherwise violated. 

The mismatch between the relational model and the standardized relational 
query languages has led researchers to provide a foundation to the manipulation 
of multisets. Other research has focused on extending the relational model by 
relaxing the first normal form assumption (which requires flat tables), giving 
rise to the notion of a nested relation and to a corresponding nested relational 
algebra. These two research streams have been integrated within the concept 
of a complex relation, which is polymorphic in nature, as different types of 
constructors other than relation coexist, such as multiset and list. 

Several other database research areas cope with multiset processing, includ- 
ing view maintenance, data warehousing, and web information discovery. 

The remainder of the paper is organized as follows. Section 2 introduces 
the relational model in a pure set-oriented framework. Section 3 discusses how 
multisets can be manipulated in relational database systems. Section 4 presents 
the nested relational model, where sets can be nested within each other to form 
nested relations. Section 5 provides some hints on how the nested relational 
model can cope with multisets, thereby yielding the notion of a complex relation. 
A discussion on related work is provided in Sect. 6. Conclusions are drawn in 
Sect 7. 

2 Relational Model 

Most database systems currently on the market are based on the relational 
paradigm, which was proposed by Codd in 1970 [16], with the main purpose 
of overcoming the typical flaws of the database systems of that period, among 
which was the inability to provide data independence, that is, the separation 
between the logical view of the model from its physical implementation. 

However, the success of the relational approach was quite slow. Its high level 
of abstraction prevented for several years the implementation of efficient struc- 
tures, as they were significantly different from those used in that epoch, typically, 
relevant to hierarchical and reticular data models. In fact, even though the first 
prototypes of relational systems have been developed since the early 1970s, the 
first relational systems appeared on the market in 1981, and became significant 
only in the middle 1980s. 

The relational data model makes use of a single structure to organize data: the 
mathematical concept of n-ary relation. Beside being easy to formalize, the rela- 
tional model has a simple representation: a relation is a table, and the database 
is viewed as a collection of tables. Another advantage of the relational model 
with respect to previous data models is that it responds to the requirement of 
data independence. 
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Intuitively, a table, which has a unique name in the database, contains one 
or more columns: each column has a heading, called attribute name, and a 
corresponding set of possible values, called the domain (for example, integers, 
strings, etc.). Each row is an ordered n-tuple of values {vi,V 2 ,-- - ,v„), each 
belonging to the domain of the corresponding attribute: each Vi is in the domain 
of column i G [1 .. n], where n is the number of columns. A table is therefore an 
unordered collection of distinct tuples. 

Formally, let U be the set of attribute names, U = {Ai,... ,A„}, and D 
the set of domains, D = {Di, . . . , Zl„}. Each domain Di contains only atomic 
values. We assume the existence of a function Dom : U i— > D, which associates 
the appropriate domain with each attribute name. Then, a tuple over a set of 
attributes A C U is a function t that associates with each attribute Ai G X a 
value of the domain Dom(Ai); this value is denoted by t(A.y We shall also write 
t(^Y) with Y Q X to denote the restriction of the function t to the attributes in 

r. 

A relation schema of a relational database has the form R = (Ai, . . . A„), 
where R is the relation name and Ai are distinct attribute names. A database 
schema, denoted by A, is a set of relation schemas with distinct relation names. 

A relation instance (or simply, a relation) defined on a relation schema R = 
(Ai, . . . , A„), is a finite set r of tuples over X = {Ai, . . . , A„}. 

A database instance 5 on a database schema Z\ is a set of relations {ri , . . . , r^} 
where each ri is defined on precisely one Ri vci A. 

With respect to the tabular representation of relation, the above definitions 
lead to the following observable properties: 

1. The values within each column are homogeneous, and they all belong to the 
same domain. The domain corresponds to the attribute of the column. 

2. There are no identical rows: a relation is a set, and therefore it does not 
contain duplicated elements. 

3. The order of columns is irrelevant, since they are identified by their name. 

4. The order of rows is immaterial, since they are identified by their content. 

The relational data model is value- oriented, as all the information are represented 
by means of values: the identity of a tuple in a relation is based only on the values 
it determines for the attributes, and the relationship among tuples in different 
relations is based only on attribute values. This is particularly important since 
not all aspects of a relation might be known at a given time: in this case the 
relation could contain values which are not specified. The domains of attributes 
can be extended by including a special null value, that represents the absence of 
information. The theory of null values is especially important for the relational 
model, where all the information is value-based. 

Moreover, part of the content of a relation is inherent to constraints on the 
values that some or all of its attributes can assume. Two important classes of 
such constraints are expressed by the notions of a key and functional dependency. 

A subset K of the attributes of a relation r is a key of r when the following 
properties hold: 
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Table 1. Relation Courses (course, year, teacher) 



course 


year 


teacher 


Algebra 


1 


Sheila 


Calculus 


1 


Gregory 


Compilers 


5 


Jerri 


Computers 


2 


Patricia 


Databases 


3 


Carol 


Geometry 


2 


Martin 


Operating systems 


4 


Angelique 


Programming langnages 


3 


Jerri 


Robotics 


5 


Richard 


Software engineering 


4 


Angelique 



1. Unique identification', r does not contain two distinct tuples ti,t 2 which agree 
on all the attributes in K, that is: 

(yti)(yt2){ti(K) ^ i2(K)) ; ( 1 ) 

2. Minimality, no proper subset of K enjoys the unique identification property. 

A set K fulfilling the unique identification property is called a superkey, since it 
is the superset of a key. The set of all the attributes in a relation is always a key 
(since all the tuples are distinct), so each relation has at least one key. 

Given two sets of attributes X and T of a relation r, we say that Y func- 
tionally depends on X in r, denoted hy X ^ Y , if and only if for every pair of 
tuples ti G r, t 2 G r, if ti(x) = ^ 2 (jc), then ti(y) = hir)- 

Intuitively, there is a functional dependency (FD) when the value of one or 
more attributes in a relation determines the value of another group of attributes. 
The concept of a key of a relation can be rephrased in terms of FD: a set K of 
attributes of r is a key if the following properties hold: 

\. K ^ Nk, where N^: is the set of attributes of r that do not belong to K-, 

2. No proper subset of K meets the same property. 

Example 1. Shown in Tables 1 and 2 are relations Courses and Teachers, 
respectively. Specifically, the schema of Courses consists of three attributes, 
namely: 

1. course, defined on the domain of course names, which identifies the course, 
in other words, course is a key for Courses; 

2. year, defined on the range [1..5], which identifies the curriculum year in 
which the course is taught; 

3. teacher, defined on the domain of teacher names, which identifies the name 
of the relevant teacher. 



The instance of Courses includes ten tuples. On the other hand, the schema 
of Teachers involves attributes name, age, and department, where name is a key. 
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Table 2. Relation Teachers (name , age, department) 



ncone 


age 


department 


Angelique 


42 


Computer science 


Carol 


38 


Computer science 


Gregory 


63 


Mathematics 


Jerri 


54 


Computer science 


Martin 


42 


Mathematics 


Patricia 


36 


Electronics 


Richard 


45 


Electronics 


Sheila 


40 


Mathematics 



Note that, in order to avoid dangling teachers, the domain of name in Teachers 
is expected to include the domain of teacher in Courses. Therefore attributes 
teacher and narnie allow us to join each tuple of Courses with one tuple of 
Teachers appropriately. Thus, to find out the department of the teacher of 
a given course, we match the appropriate tuple in Courses with the one in 
Teachers which agree on attributes teacher and name, respectively. □ 



2.1 Relational Algebra 

Once a relational database schema has been instantiated, it is possible to retrieve 
and modify the stored information by means of appropriate languages. Formally, 
updating the database amounts to changing its state. Denoting with S the do- 
main of states of a database schema A, the update language describes functions 
(p between the domain of states, namely ^ : S S. On the other hand, querying 
a database amounts to creating new relations, therefore query languages de- 
scribe functions from S to the set of relations over possible schemas. Relations 
yielded by queries are temporary, that is, they disappear automatically from the 
database after they have been displayed to the user as a side effect of the query 
evaluation. Thus, we distinguish base relations from temporary relations, where 
the former are those relations which instantiate the database schema, while the 
latter correspond to query results. However, temporary relations may be trans- 
formed into base (persistent) relations by special-purpose create statements. 

A great deal of effort in the design of relational database systems has been 
devoted to query languages, among which is relational algebra. Relational alge- 
bra consists of expressions applied to relations. Intuitively, a relational-algebra 
expression follows the pattern of arithmetic expressions, where variables are re- 
placed by relation names, while arithmetic operators, such as -I-, — , *, and /, are 
replaced by relational operators. The application of a relational operator to one 
or two relations yields a new relation, exactly as the application of an arithmetic 
operator gives rise to a new number. Likewise, relational operators may be ap- 
plied to relational expression, exactly as arithmetic operators can be applied to 
arithmetic expressions, for example, {x — y)*{z+w). Note that relational algebra 
is in fact an algebra because the application of relational operators on relations 
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Table 3. Relation Priorities (course , prerequisite) 



course 


prerequisite 


Compilers 


Software engineering 


Computers 


Calculus 


Geometry 


Algebra 


Operating Systems 


Programming languages 


Programming languages 


Computers 


Robotics 


Geometry 


Robotics 


Programming languages 


Software engineering 


Programming languages 


Software engineering 


Databases 



produces relations. In other words, relational expressions are closed with respect 
to the domain of possible relations. 

A query is an expression of relational algebra. Since relations are essentially 
sets, relational algebra deals with the manipulations of sets. However, it is pos- 
sible to extend the semantics of relational operators to multisets, as outlined in 
Sect. 3. As a matter of fact, concrete relational query languages based somehow 
on relational algebra, such as SQL (see Sect. 3.4), allows the manipulation of re- 
lations in a multiset-oriented way, where, for efficiency reasons, duplicate tuples 
are not removed unless explicitly specified. 

Relational-algebra operators include projection, selection, product, join, re- 
naming, union, intersection, and difference. Furthermore, for modularity reasons, 
we introduce the assignment statement. 

Projection. The projection operator is unary: it takes a relation and produces a 
new relation containing only a subset of its columns. Let r be a relation defined 
over the schema R containing the set X of attributes, and let T C A. The 
projection of r onto Y, denoted by iryir), is a relation on the attributes in Y 
consisting of the restrictions of the tuples of r to the attributes in Y : 

T^Y{r)=^{t^Y)\t&r}. ( 2 ) 

Example 2. Shown in Table 3 is a relation called Priorities, whose schema 
includes attributes course and prerequisite. Priorities is meant to spec- 
ify the prerequisites of each course, that is, the courses a student is required 
to have passed before taking the exam of that course. For example. Geometry 
requires Algebra as prerequisite. Software engineering requires two prereq- 
uisites, namely Programming languages and Databases. Note that both at- 
tributes course and prerequisites are part of the key. In fact, given a course, 
there exist in general several different prerequisites for it. On the other hand, a 
course may be a prerequisite for several different courses. As a matter of fact. 
Programming languages is a prerequisite for three courses. This means that 
neither course nor prerequisite is a key by its own, but only the composition 
of both. 
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Table 4. Result of relational-algebra Expression (3) 



course 

Software engineering 

Calculus 

Algebra 

Programming languages 
Computers 
Geometry 
Databases 



To find out the courses which are prerequisites for some courses we write the 
following expression: 

^prerequisite (priorities) ( 3 ) 

which will give the result outlined in Table 4 . Note that, generally speaking, 
some of the tuples may become identical when they are projected on a set of 
attributes: in this case duplicated tuples are deleted, so the resulting relation 
may actually contain less tuples than the operand. In our example, two of the 
three tuples relevant to Programming languages have been eliminated from the 
result, so that the resulting relation includes seven tuples instead of nine. □ 



Selection. The selection operator is unary. Intuitively, the result of the selection 
is the subset of the tuples in the operand that satisfy a selection predicate 
expressed in terms of elementary comparisons of constants and attribute values 
plus logic connectives. 

Propositional formula. Let r be a relation over the set of attributes X; a propo- 
sitional formula p over X is defined recursively as follows. Atoms over X have 
the form AidA2 or Ai'da, where A\ G X, A2 G X, a is a constant, and -d is a 
comparison operator, 1} G {=,<,>,yf,>,<}. Every atom over X is a proposi- 
tional formula over X; if pi, p2 are propositional formulas over X, then ^(pi), 
pi A p2, and pi V p2 are formulas over X. Parentheses can be used as usual. 
Nothing else is a formula. A propositional formula associates a Boolean value 
with each tuple in r. 

Given a relation r over the schema R, the selection of r with respect to p, 
denoted by <Jp{r), is a relation over the same schema R, containing the tuples 
of r that make p true: 

^p(r) {t&r\ p{t)} . ( 4 ) 

Example 3. Considering Table 1 , to find out the course names of the last two 
years we write: 



^course (^year>4 (CourseS ) ) 



( 5 ) 
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Table 5. Result of relational-algebra Expression (5) 



course 
Compilers 
Operating Systems 
Robotics 

Software engineering 



which is expected to produce the result outlined in Table 5. Note that the tem- 
porary result of the selection is the operand of the subsequent projection. □ 

Product. It is a binary operator. Let ri and r2 be two relations defined over 
the set of attributes X and Y, respectively, such that XHY = 0. The product of 
ri and r2, denoted by ri x r2, is a relation on X LI Y consisting of all the tuples 
resulting from the concatenation of tuples in ri with tuples in r2' 

Hpf 

M X T 2 = {t over XLY \ {3ti){3t2){ti G n, ^2 G T 2 , t^x) = h,t(Y) = h)} ■ 

( 6 ) 

Join. It is a binary operator that comes in two versions, usually referred to as 
natural join and theta-join. Let r\ and r2 be two relations defined over the set 
of attributes XY and YZ, such that XY n YZ = Y. The natural join of ri and 
C2, denoted by ri M r2, is a relation on XYZ = XY U YZ consisting of all the 
tuples resulting from the concatenation of tuples in ri with tuples in T2 that 
have identical values for the attributes Y : 

Hpf 

n XI T2 = {t over XYZ \ ( 3 ti)( 3 t 2 ) {h G ri,t2 G r2, t(XY) — tljtiYZ) — t 2 } ■ 

( 7 ) 

Example j. To associate with ech course and corresponding teacher the relevant 
prerequisite we write: 

X course. teacher. prerequisite^^ ours eS X Priorities^ (8) 

which generates the relation outlined in Table 6. □ 

Given two relations ri and r2 over disjoint sets of attributes X and Y, a theta-join 
Xp T2 is a relation over the set of attributes XLY containing tuples obtained 
by the concatenation of tuples of r\ and r2 that satisfy the propositional formula 
p, that is: 

ri Xp T2 {t over XLY \ ( 3 ti)( 3 t 2 ) (ti G ri,t2 G r2, t(x)=ti, t(y)=t2, p(t))} . 

(9) 

The theta-join can be expressed through the use of selection and product as 
follows: 



ri Xp T2 = <Jp(ri X T2) . 



( 10 ) 
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Example 5. To retrieve, for each course, the department the corresponding teacher 
depends on we write: 

^course. department (^C)UrseS l^teacher— name Teachers) (11) 

which yields the relation outlined in Table 7. In this case we used a theta-join 
as the two linking attributes teacher and name have different identifiers. □ 



Renaming. It is a unary operator that only changes the name of the attributes 
in the result, leaving the content of the relation unchanged. It is used to overcome 
difficulties with those operators for which attribute names are significant. Let r 
be a relation defined over a set of attributes X, and Y another set of attributes 
with the same cardinality of X, that is, |X| = |y|. Besides, let Ai,A 2 ,... ,Ak 
and be an order for attributes in X and Y respectively. The 

renaming 



( 12 ) 

is a relation which includes a tuple t' for each tuple t G r, defined as follows: t' is a 
tuple on Y and = t(Ai)> * G [1 •• k]- In practice, only the renamed attributes 



Table 6. Result of relational-algebra Expression (8) 



course 


teacher 


prerequisite 


Compilers 


Jerri 


Software engineering 


Computers 


Patricia 


Calculus 


Geometry 


Martin 


Algebra 


Operating systems 


Angelique 


Programming languages 


Programming languages 


Jerri 


Computers 


Robotics 


Richard 


Geometry 


Robotics 


Richard 


Programming languages 


Software engineering 


Angelique 


Databases 



Table 7. Result of relational-algebra Expression (11) 



course 


department 


Algebra 


Mathematics 


Calculus 


Mathematics 


Compilers 


Computer science 


Computers 


Electronics 


Databases 


Computer science 


Geometry 


Mathematics 


Operating systems 


Computer science 


Programming languages 


Computer science 


Robotics 


Electronics 


Software engineering 


Computer science 
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will be indicated within the two lists Ai, . . . ,Aj~ and A'^t ■ ■ , A'^, namely those 
for which Ai yf A', rather than the complete list of attributes. 

Since the renaming operator is not intended to change the domain of the 
attributes, we also require Dom(yl9 = Dom(Ai). 

Example 6. We may formulate the same operation of Example 5 by replacing 
the theta-join in Expression (11) with a renaming and a natural join as follows: 

^course. department ^ (Pteacher.* — name(l®^^hers))) (13) 

which yields the same relation displayed in Table 7. □ 

It is worthwhile pointing out that a natural join can be expressed in terms of 
projection, theta-join, and renaming operations. Let ri and T 2 be two relations 
over schemas XY and YZ, respectively, where XY n YZ = Y. Then, the following 
equivalence holds: 



ri Nr2 = 7T„,(ri (Pa;,... (?'2)) (14) 

where Y = {Ai,... ,Ak\, Y' = {A'^, . . . ,A'f.}, XY n E'Z = 0, and is a 
predicate defined as follows: 



Py,Y' 




true if = t(v') 
false otherwise . 



(15) 



Set-theoretic operators. Since relations are special classes of sets, the mean- 
ing of the well-known set-theoretic operators can be retained for relations as well. 
Specifically, the union, intersection, and difference of two relations r and s are 
denoted by r U s, r n s, and r — s, respectively. However, since the result is ex- 
pected to be defined over an expected schema, these operators require r and s 
to share the same schema, otherwise the tuples of the resulting relation would 
refer to different sets of attributes. 

Example 7. To find the courses which either depend on some prerequisite courses 
or are themselves prerequisites for other courses we may formulate the following 
expression: 

(^course ies) ) U (Pcourse.^ — prerequisite (^prerequisite ies) ) ) 

(16) 

whose result is displayed in Table 8. Incidentally, all the courses in Courses are 
incorporated in Table 8. Duplicate courses have been removed. 

Instead, if we are to retrieve the courses which are both prerequisite for some 
courses and depend on other courses we have just to change the union operator 
in Formula (16) with an intersection as follows: 

(^course (pylori t ies) ) n (pcourse.^ — prerequisite (^prerequisite (Priorities) ) ) 

( 17 ) 
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Table 8. Result of relational-algebra Expression (16) 



course 

Algebra 

Calculus 

Compilers 

Computers 

Databases 

Geometry 

Operating Systems 

Programming langnages 

Robotics 

Software engineering 



Table 9. Result of relational-algebra Expression (17) 



course 

Computers 

Geometry 

Programming langnages 
Software engineering 



whose result is outlined in Table 9. 

Finally, the following expression yields the courses which require some other 
courses, but are not prerequisite of any course: 

(^course (Priorities)) (pcourse-^ — prerequisite (^prerequisite (Priorities) ) ) 

(18) 

whose result is outlined in Table 10. □ 



Assignment. The assignment operator does not extend the expressive power of 
relational algebra. In other words, all the operations involving the assignment can 
be expressed as well using the operators defined above. However, the assignment 
operator is useful because it allows the specification of expressions in a more 
concise and modular fashion. An assignment has the following form: 

w ^ E{n,r 2 ,... ,r„) (19) 



Table 10. Result of relational-algebra Expression (18) 



course 
Compilers 
Operating Systems 
Robotics 
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where w is an identifier and E a relational expression. In general, a query ex- 
pressed in relational algebra consists of a (possibly empty) sequence of assign- 
ments and a final relational expression Q: 

wi ^ Ei{ri,T 2 , ... ,r„) 

W2 ^ E2{ri,r2, . . . ,rn,wi) 

( 20 ) 

Wm ^ Em{ri,r 2 , ■ . ■ ,rn,Wi,W 2 , ■ . ■ ,Wm-l) 

Q{ri,r 2 ,... ,rn,Wi,W 2 ,... ,Wm) ■ 

Each assignment instantiates a temporary relation Wi whose schema and instance 
are determined by the corresponding expression Ei. However, relations Wi are 
only visible within Q and disappear as soon as the execution of Q terminates. 

Example 8. Consider the problem of determining the age of the youngest teacher. 
Note that relational algebra does not provide any aggregation function like min 
to determine a single value (e.g. maximum, minimum, or average) starting from 
a set of values. Consequently we have to specify the relevant query by means of 
relational operators only. This can be done as follows: 

Ages <— TTage (Teachers) 

NotMin ^ TTage (cTage>agel (Ages X (Pagel<-age(Ages)))) (21) 

Ages — NotMin . 

In the above sequence of expressions, the first two are assignments, while the 
third is the displayed result. Ages is a temporary relation incorporating the whole 
set of ages in Teachers (see Table 2). The other temporary relation NotMin 
represents the set of ages which cannot be the minimum value. This claim is 
supported by the evidence that each age is combined with each other age in Ages 
by means of the product (the renaming is necessary to avoid duplicated attribute 
names in the resulting schema). Then a selection is applied to the result of the 
product: only the ages which are greater than at least another age are selected. 
Consequently, NotMin will embody all the ages but the minimum, this being the 
age which is not greater than any other age. Hence, the final result is obtained 
by simply complementing NotMin with respect to Ages through the difference 
operation. The final result is therefore the singleton {36}, which incorporates 
the age of Patricia, the youngest teacher. □ 

3 Multisets in Relational Database Systems 

The notion of a relation as a set of tuples is a simple and formal model of data. 
Being a set, a relation cannot include duplicates. Accordingly, all the relations 
involved in the examples of Sect. 2 do not incorporate any duplicate tuple. 
However, commercial relational database systems are almost invariably based 
on multisets instead of sets. In other words, tables are in general allowed to 
include duplicate tuples. 
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The main reason for considering relations as multisets in database systems is 
efficiency of query evaluation. Normally, base relations do not include duplicates. 
However, duplicate tuples can be generated within the evaluation of a relational- 
algebra expression which involves either projections or unions. For example, 
when we do a projection, if we want the result to be a set, we are required to 
compare each tuple in the result with all the other tuples in the result in order 
to be sure that tuples are not replicated. Generally speaking, duplicate removal 
is very expensive in time. Instead, if we are allowed to have a multiset as the 
result, then we merely project each tuple and add it to the result, regardless of 
other occurrences of the same tuple. 

Likewise, when making the union of two relations ri and T 2 , if we want the 
result to be a set, we must check that each tuple of one relation is not already 
included in the other. That is, if ni and ri 2 are the number of tuples in r\ and 
C 2 , respectively, we are expected to make ni ■ comparisons. Instead, if we 
accept a multiset as the result, then we just copy all the tuples of ri and r 2 into 
the answer, regardless of whether or not they appear in both relations, thereby 
making ni + U 2 operations without any comparison. 

Sometimes, a hybrid approach is adopted: duplicate removals are delayed as 
much as possible in order to optimize efficiency whilst obtaining a set as the 
result. This means that, from the user point of view, relation are sets, while the 
evaluation system makes the best choices to produce the required result based 
on implementation-oriented criteria. 

Example 9. With reference to Example 7, we may represent Expression (16) as a 
tree where leaves and internal nodes correspond to base relations and operators, 
respectively, as shown in Fig. 1. 

A temporary relation is associated with each internal node, that implicitly 
represents the intermediate result of the expression inherent to the corresponding 
sub-tree. 

Thus, the tree of Fig. 1 incorporates four internal nodes and two leaves, which 
incidentally happen to refer to the same base relation Priorities. Three nodes 
are virtually subjected to duplicate removals, these being the two projections 




Priorities 



Priorities 



Fig. 1. Evaluation tree of Expression (16) 
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Table 11. Multiset result of Expression (16) 



course 

Compilers 

Computers 

Geometry 

Operating Systems 

Programming languages 

Robotics 

Robotics 

Software engineering 
Software engineering 
Software engineering 
Calculus 
Algebra 

Programming Languages 

Computers 

Geometry 

Programming Languages 
Programming Languages 
Databases 



and the union. If we allow the result to be a multiset, we will obtain the result 
displayed in Table 11. 

Note that Table 11 incorporates 18 tuples while the set result of the same 
expression outlined in Table 8 includes 10 tuples only. However, the multiset 
result, although larger, can be computed more quickly. □ 

The instance of a multiset m can be implemented in two different ways: 

1. Extensional representation: different occurrences of the same tuple are phys- 
ically replicated, such as in Table 11. 

2. Intensional representation: the whole collection of occurrences of the same 
tupletis physically implemented by a single tuple (t,w),whereo; = Occ(t,m). 



Example 10. The intensional representation of the multiset displayed in Table 11 
is shown in Table 12. □ 

Note that the intensional representation requires w to be updated whenever 
duplicates may be generated, thereby losing the advantages of the extensional 
representation in terms of computation efficiency. On the other hand, represent- 
ing intensionally several tuples by means of a single tuple allows the database 
system to save space and even time in those operations which do not introduce 
duplicates, as a smaller set of tuples are processed^. 



^ From the physical point of view, a multiset stored intensionally is in fact a set. 
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Table 12. Intensional representation of the multiset displayed in Table 11. 



course 



Compilers 
Computers 
Geometry 
Operating Systems 
Programming languages 
Robotics 

Software engineering 

Calcnlus 

Algebra 

Databases 



1 

2 

2 

1 

4 

2 

3 

1 

1 

1 



3.1 Relational Algebra for Multisets 

The operators of relational algebra can be straightforwardly extended for manip- 
ulating multisets. However, as it will be shown in Sect. 3.3, some of the algebraic 
properties which hold for sets hold no longer for multisets. In what follows, multi- 
sets are enclosed within square brackets. Furthermore, the number of occurrences 
of t within a multiset m is denoted by the function Occ(t, m). For example, if m 
is the multiset displayed in Table 11 and t = (Software engineering), we have 
Occ(t, to) = 3. 

In order to univocally define a multiset, it is not enough to specify its ele- 
ments, it is also necessary to assign a cardinality to each of them. Thus, in what 
follows, the definition of the multiset resulting from each operation is usually 
twofold. Indeed, the specification of the cardinality of each element is a com- 
plete definition as, if the cardinality of an element is zero, then such an element 
will not belong to the multiset. Therefore, in some cases, only the definition of 
the cardinality is given. 



Projection of multisets. Provided that multiple occurrences of the same tu- 
ple can be inserted into a multiset, the definition of projection of multisets is 
formally the same given for sets. Let to be a multiset defined over the schema 
M containing the set X of attributes, and let F C A. The projection of to onto 
Y, TTY{m), is a multiset on the attributes in Y consisting of the restrictions of 
the tuples of to to the attributes in Y : 

7Ty(TO) [t{Y) I t € to] . (22) 

However, in contrast with Formula (2), duplicate restrictions of t on F are not 
removed from the result. Specifically, the number of occurrences is given by the 
following formula: 



Occ(t,7rr(m)) ^ Occ(t',m) . 

t' 



(23) 
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Selection of multisets. The selection operation on a multiset m simply applies 
the selection predicate to each tuple in m and retains in the result those tuples 
t G m for which the predicate evaluates to true: 



= [t Gm\ p{t)] . ( 24 ) 

Note that the Boolean value p{t) is the same for all the occurrences of t in to. 
Consequently, depending on the value of p(t), either all the occurrences of t are 
selected or none or them, that is: 



Occ(t, (Tp(to)) = 



Occ{t,m) if p{t) 

0 otherwise . 



( 25 ) 



This is due to the inability to distinguish different occurrences of t within to 
based on p. Furthermore, as for sets, the selection operator does not generate 
new duplicates. However, it preserves the selected duplicates in the result. 



Product of multisets. The product operation can be extended for multisets 
in a natural way. Let toi and to- 2 be two multisets defined over schemas X and 
F, respectively, such that X HY = 0. The product TOi x to -2 is a multiset over 
X UY consisting of all the tuples resulting from the concatenation of tuples in 
TOi with tuples in m2: 



TOi X TO2 = [t over XUY \ {3ti){3t2){ti Gmi,t 2 G TO2,t(x) = = ^2)] • 

( 26 ) 

In other words, each tuple of toi is paired with each tuple of m2, regardless 
of whether it is a duplicate or not. As a result, if tuple ti appears rii times 
in TOi and tuple ^2 appears ri2 times in m2, then, denoting with ti 0 ^2 the 
concatenation of tuples ti and t 2 , in the product toi x to- 2, the tuple ti 0 t 2 will 
appear rii • U2 times, that is: 

Occ(ti 0 ^2, TOl X TO2) Occ(ti, TO-i) • Occ(t2, W2) . ( 27 ) 



Joiu of multisets. Both natural join and theta-join can be extended for mul- 
tisets. Let TO-i and m2 be two multisets defined over disjoint schemas X and Z, 
respectively. Then, the theta-join of toi and to- 2 is defined as follows: 

TO-i Np TO2 =*' [t over XUZ \ ( 3 ti)( 3 t 2 ) {tiGri,t2Gr2,t(^x)=ti,t(Y)=t2, p(t))] . 

( 28 ) 

The equivalence expressed in Equation ( 10 ) is required to hold for multisets as 
well, that is: 



TOi Mp TO-2 = CTp(TOi X m 2 ) ■ 



( 29 ) 
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Therefore, based on the above equivalence, we can derive the number of occur- 
rences of tuples in the theta-join result based on Equations (27) and (25) as 
follows: 



Occ(ti 0 ^2, TO2) = Occ(ti 0 ^2, CTp(mi X TO2)) 



_ r Occ(ti 0 t2,TOl X m 2 ) ifp(ti0t2) 

( 0 otherwise 

^ f Occ(ti,mi) • Occ(t 2 ,W 2 ) ifp(ti 0 t 2 ) /on', 
( 0 otherwise . 

The natural join for multisets can be defined as follows. Let mi and m 2 be 
two multisets defined over the set of attributes XY and YZ, respectively, where 
XY n YZ = V. The natural join mi ixi m 2 , is a multiset over XYZ = XY U YZ 
consisting of all the tuples resulting from the concatenation of tuples in mi with 
tuples in m 2 that have identical values for the attributes V : 

def 

mi M m 2 = [t over XYZ | (3ti)(3t2) (ti € mi,t 2 € m 2 ,t(xY) = fi,i(YZ) = ^2 ] ■ 

(31) 

We require the multiset version of Equivalence (14) to hold, that is: 

mi Mm 2 = 7T„,(miMp^^^ (32) 

where is the predicate defined in Formula (15). Then, we make use of 

Formula (32) to compute the number of occurrences of tuples in the result. Let 

^2 = P4P...,A^^Ai,...,A,(to2) . (33) 

Then, based on Formulae (23) and (30), the number of occurrences of tuples in 
the natural join result can be derived as follows: 

Occ(t, mi M m 2 ) Occ(t, 7r„2(mi ^ 2 )) = 



Occ(t',mi m' 2 ) = 



E 

™2)4(XYZ)~*4(XY)~*l’*(y/Z )— *2 

E 



Occ(tj,mi) • Occ(< 2 ,TO 2 ) if py y,(t') ^ 
0 otherwise 

Occ(t'i, mi) • Occ(t 2 , W 2 ) (34) 



*(xyz)“*’*(>;y)“*i 4(1" >*2 )“*(!") 



where Y' Z = Y'UZ. 
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Renaming of multisets. From the instance point of view, the renaming opera- 
tion is an identity function. In other words, the result differs from the operand in 
the schema whilst retaining the instance. As such, it can be applied to multisets 
based on the same semantics defined for relations. In particular, given a multiset 
m, the number of occurrences of each tuple in the result does not change, that 
is: 

Occ(t,pA;,A^,...,A;^AiM2....M;,M) =‘" Occ(t,m) . ( 35 ) 

Set-theoretic operators for multisets. The set-theoretic operations defined 
for relations in Sect. 2 . 1 , namely union, intersection, and difference, can be ex- 
tended to multisets in a natural fashion. Let toi and m2 be two multisets defined 
over the same schema. The union of mi and m2 is defined as follows: 

mi U m2 [t I t G mi V t G m2 ] . ( 36 ) 

However, the peculiarity of the union definition lies in the fact that the resulting 
multiset is expected to incorporate all the occurrences of tuples, both in mi and 
m2, that is: 



Occ(t,miUm2) = Occ(t, mi) -I- Occ(t, m2) . ( 37 ) 

The intersection operation mi Hm2 will include a number of occurrences, of each 
shared tuple, equal to the minimum number of occurrences of the tuple in mi 
and m2, respectively, that is: 

Hpf 

Occ{t,mi n m2) = min(Occ(t, mi), Occ(t, m2)) . ( 38 ) 

In particular, if the tuple t is included in mi but not in m2, it follows that, 
min(Occ(t, mi), Occ(t, m2)) = 0 , whatever Occ(t,mi). That is, as expected, t 
will not be part of the intersection. On the other hand, if t is both included 
in mi and m2, the result will include the smallest multiset of t, that is, the 
minimum number of occurrences of t in mi and m2. 

The difference operation mi — m2 will include a number of occurrences of each 
tuple t which depends on whether or not Occ(t, mi) > Occ(t,m2). Specifically, 
if this relationship is met, then the number of occurrences in the result will be 
the difference Occ(t,mi) — Occ(t,m2), otherwise the result will not include t. 
Formally, 



Occ(t,mi— m2) = max((Occ(t,mi) — Occ(t, m 2 )), 0 ) . ( 39 ) 



Duplicate removal. Given a multiset m, we may introduce a special unary 
operator called duplicate removal, denoted by 5 {m), which generates a multiset 
obtained by removing from m duplicate occurrences of tuples. Formally, let 
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be a partition of m where each part T{ti), i G [1 ..k], is the multiset of occur- 
rences of tuple ti. Then, 

S{m) ■■■ ,tk] ■ (41) 

Clearly, 

Occ(t, 5(m)) 1 . (42) 

3.2 Multiset Operations on Relations 

Intuitively, each relation r can be viewed as a special multiset m in which each 
tuple happens to occur just once within the multiset, that is 

(Vt G to) (Occ(t,TO-) = 1) . (43) 

We say that to is the multiset mirror of r, denoted by 9Jt(r). There exists an 
isomorphism between the tuples of r and to. More precisely, r and to incorporate 
the same tuples. Consequently, we may apply to r both the operators of relational 
algebra for relations and those extended for multisets, as defined in Sect. 3.1. 
This is due to the fact that there exists an isomorphism between the operators for 
relations and those for multisets, that is, for each operator of relational algebra 
there exists a homonymous operator of the extended algebra for multisets and 
vice versa. In other words, the algebraic operators are polymorphic in nature, 
as they can be applied either to relations or multisets, even though, generally 
speaking, with different semantics. 

Let r, Ti, and r 2 denote relations. Let uj denote an (either unary or binary) 
generic (polymorphic) algebraic operator. Then, it is worthwhile finding out 
which operators meet the following equivalences: 

9Jt(w(r)) = w(9Jt(r)) , . 

lH(ri UJ T 2 ) = 9H(ri) uj ®l(r 2 ) ' 

If Equivalence (44) holds for an operator uj, it means that considering the 
operand as either a relation or a multiset is the same, that is, it leads to the 
same result. 

You can verify that all of the unary operators but projection meet the first 
equivalence in (44), that is: 

(Vw G {cr,p}) (9Jl(w(r)) = w(9Il(r))) . (45) 

The discrepancy of projection stems from the possible duplicate generation in 
multisets, which are instead removed in relations. Likewise, all of the binary 
operators but union meet the second equivalence in (44), that is: 

(Vw G { X , M, Mp, n, -}) (9Jl(ri UJ r 2 ) = 97l(ri) uj M{r 2 )) ■ (46) 

To understand why union does not fulfill the equivalence, let us assume the case 
in which ri and r 2 contain the same tuple t. If so, the union of relations, ri Ur 2 , 
will contain just one instance of t, as the duplicate tuple is removed from the 
result. By contrast, the union of the corresponding multiset mirrors will embody 
two occurrences of t. 
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3.3 Algebraic Laws for Multisets 

An algebraic law is an equivalence between two expressions of relational algebra. 
Each expression involves a number of variables denoting relations. The equiva- 
lence establishes that no matter what relations we replace for these variables, the 
two expressions give rise to the same result. However, an equivalence law which 
is valid for relations may happen not to hold when variables are interpreted as 
multisets. 

A list of equivalence laws which are both valid for relations and multisets is 
given below, where x, y, and z denote collections (either relations or multisets) 
variables, A a subset of the attributes of the relevant projection operand, and 
pi a generic predicate on the tuples of the selection operand: 
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By contrast, the following algebraic laws hold for relations but not for mul- 
tisets: 



(x n y) - z = 


1 

c 


(56) 


X n (y U z) = 


(x n y) u (x n z) 


(57) 


^PlVp2(x) = 


CTpi (x) U CTp, (x) 


(58) 



3.4 SQL and Multisets 

Relational algebra provides a concise notation for defining queries. However, 
commercial database systems usually adopt a query language that is more user- 
friendly and even more powerful than relational algebra. The most commonly 
used concrete query language is SQL [13, 19, 37]. As a matter of fact, SQL has 
established itself as the standard relational-database language, which is not only 
a query language but also incorporates several other constructs, such as updating 
the database and defining security constraints. 

Several different dialects (versions) of SQL exist. On the one hand, there are 
standards, among which are ANSI SQL and an updated standard called SQL2. 
There is also an emerging standard called SQL3 that extends SQL2 with several 
new capabilities, including recursion, triggers, and objects. On the other hand, 
there are many versions of SQL produced by the principal vendors of database 
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management systems, which usually conform completely to ANSI SQL, but also, 
to a large extent, to SQL2. To a lesser extent, some of them, include some of the 
advanced capabilities of SQL3. 

The data model of SQL is relational in nature, as well as the relevant oper- 
ations. However, unlike relational algebra, the tables manipulated by SQL are 
not relations, but, rather, multisets. The reason for this peculiarity is twofold. 
First, as mentioned in Sect. 2.1, this is due to a practical reason: since SQL ta- 
bles may be very large, duplicate elimination might become a bottleneck for the 
computation of the query result. Second, SQL extends the set of query operators 
by means of aggregate functions, whose operands are in general required to be 
multisets of values, as illustrated in the sequel. 



SQL query paradigm. In its simplest form, the typical SQL query is expressed 
by the so-called select-from-where paradigm, generically defined as follows: 

SELECT Ai, . . . , A^k 

FROM Ml,... ,M„ (59) 

WHERE p 

where each Mj, i G [l-.n], is a table (multiset of tuples) name, each Aj, j G 
[1 .. fc], is an attribute name, and p a predicate. In terms of algebraic operators 
for multisets, the above query can be interpreted as follows: 

X ■■■ X M„)) . (60) 

Therefore, the evaluation of the select-from-where statement is based on the 
following steps: 

1. The tables listed in the FROM clause are combined through the product; 

2. The corresponding result is filtered by the selection predicate p specified in 
the WHERE clause; 

3. The filtered tuples are projected on the list of attributes specified in the 
SELECT clause. 



Example 11. With reference to the multiset mirrors of relations Courses and 
Teachers shown in Tables 1 and 2, respectively, we may display the name and 
the relevant department of those courses whose teachers are older than 40 as 
follows: 

SELECT course , department 

FROM Courses, Teachers 

WHERE teacher = name AND age > 40 



which yields the result outlined in Table 13. Incidentally, no duplicate tuples are 
generated. □ 
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Table 13. Result of SQL query of Example 11 



course 


department 


Calculus 


Mathematics 


Compilers 


Computer science 


Geometry 


Mathematics 


Operating systems 


Computer science 


Programming languages 


Computer science 


Robotics 


Electronics 


Software Engineering 


Computer science 



Example 12. With reference to the multiset mirrors of relations Courses, 
Teachers, and Priorities, displayed in Tables 1, 2, and 3, respectively, we may 
find out the courses which depend on some prerequisite course whose teacher is 
in the Computer science department as follows: 

SELECT course 

FROM Priorities, Courses, Teachers 
WHERE prerequisite = course AND 

teacher = name AND department = ’Computer science’ 

which displays the result shown in Table 14. Note that, in contrast with the 
SQL query of Example 11, Table 14 is a multiset where the course Software 
engineering is replicated. □ 



Duplicate removal in SQL. In order for a result of a select-from-where state- 
ment not to include duplicate tuples, we are required to follow the keyword 
SELECT by the keyword DISTINCT, which tells SQL to produce only one occur- 
rence of any tuple. Consequently, the result is guaranteed to be duplicate-free. In 
terms of the algebraic operators for multisets defined in Sect. 3.1, the set-oriented 
select-from-where paradigm 

SELECT DISTINCT Ai, . . . ,Au 

FROM Ml,... ,M„ (61) 

WHERE p 



Table 14. Result of SQL query of Example 12 



course 
Compilers 
Operating Systems 
Robotics 

Software engineering 
Software engineering 
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can be interpreted by prefixing Expression (60) with the duplicate removal op- 
erator as follows: 



X ••• X M„))) . (62) 

In order to provide SQL queries with a set-oriented semantics, one might be 
tempted to place DISTINCT after every SELECT, on the assumption that it is 
harmless. On the contrary, it might be very expensive to perform duplicate 
removal on a table. Generally speaking, the table must be sorted so that the 
different occurrences of the same tuple appear next to each other. In fact, only 
by grouping the tuples in this way can we determine whether or not a given 
tuple should be eliminated. It is very likely that the time needed to sort the 
table is a good deal greater than the time it takes to execute the query itself. 
Consequently, the keyword DISTINCT should be used carefully. 

Set-theoretic operations in SQL. SQL provides the usual set-theoretic op- 
erators, which can be applied to the results of queries, provided these queries 
produce tables with the same schema. Union, intersection, and difference op- 
erators are denoted in SQL by the keywords UNION, INTERSECT, and EXCEPT, 
respectively. However, unlike the SELECT statement, which preserves duplicates 
as a default and only eliminates them when instructed to by the DISTINCT 
keyword, the set-theoretic operators eliminate duplicates by default. We may 
change such a default by following UNION, INTERSECT, or EXCEPT, by the key- 
word ALL, thereby enabling the multiset semantics of such operators, as detailed 
in Sect. 3.1. 

Example 13. We may rephrase in SQL Expression (16) of Example 7, which 
retrieves the courses which either depend on some prerequisite courses or are 
themselves prerequisites for other courses, as follows (the keyword AS allows 
us to change the name prerequisite into course, that is, to implement the 
renaming) : 

(SELECT course FROM Priorities) 

UNION 

(SELECT prerequisite AS course FROM Priorities) 
obtaining the same result displayed in Table 8. If, instead we write 

(SELECT course FROM Priorities) 

UNION ALL 

(SELECT prerequisite AS course FROM Priorities) 

our result will preserve the duplicate tuples, as outlined in Table 15. 

Compared with Table 8, the result in Table 13 includes duplicates for courses 
Robotics, Software engineering. Programming languages. Computers, and 
Geometry. However, apart from duplicates. Table 13 contains the same infor- 
mation of Table 8, even if with some redundancy. By contrast, it is worthwhile 
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Table 15. Result of multiset interpretation of Expression (16) 



course 

Compilers 

Computers 

Geometry 

Operating Systems 

Programming languages 

Robotics 

Robotics 

Software engineering 
Software engineering 
Software engineering 
Calculus 
Algebra 

Programming languages 

Computers 

Geometry 

Programming languages 
Programming languages 
Databases 



highlighting that the multiset counterpart of Expression ( 18) is inconsistent with 
the given query, that asks for the courses which require some other courses but 
are not prerequisite of any course: 

(SELECT course FROM Priorities) 

EXCEPT ALL 

(SELECT prerequisite AS course FROM Priorities) 

Based on the multiset version of the difference operation defined in Sect. 3.1, 
the above SQL query gives rise to the result displayed in Table 16. Comparing 
Table 16 with Table 10, we note two discrepancies. First, in Table 16, Robotics is 
replicated, due to the projection within the first operand of the EXCEPT operator 
onto course. This is however only a redundancy. Second, Table 16 embodies the 
course Software engineering, which is instead not included in Table 10. Such 
a ‘spurious’ tuple stems from the semantics of difference operation for multisets. 



Table 16. Result of multiset interpretation of Expression (18) 



course 

Compilers 
Operating Systems 
Robotics 
Robotics 

Software engineering 
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in which, according to Formula (39), the number of occurrences of each tuple t 
= “Software engineering” in the result is 

Occ(t, mi — m 2 ) = max((Occ(t, mi) — Occ{t, m 2 )), 0) = max((2 — 1), 0) = 1 . 

(63) 

However, we cannot obtain the required result by simply removing the ALL qual- 
ifier from EXCEPT, thereby writing the following SQL query: 

(SELECT course FROM Priorities) 

EXCEPT 

(SELECT prerequisite AS course FROM Priorities) 

In fact, as duplicate removal is activated after the difference operation, the above 
statement will simply remove the duplicate course Robotics, preserving however 
the spurious tuple of Software engineering. To implement the correct query, 
we should also eliminate duplicates from the first operand as follows: 

(SELECT DISTINCT course FROM Priorities) 

EXCEPT 

(SELECT prerequisite AS course FROM Priorities) 
which yields the expected result displayed in Table 10. □ 

Aggregate operations in SQL. Among other capabilities, SQL provides ag- 
gregate functions. An aggregate function takes a collection of values as input, 
which is in general a multiset, and returns a single value. Let a be the multiset 
of values of an attribute A of a table m, and IF an aggregate function. Then, 

T{a) (64) 

is expected to return a scalar value based on a. SQL provides the following 
aggregate functions: 

1. SUM, the sum of the values of a; 

2. AVG, the average of values of a; 

3. MIN, the least value in a; 

4. MAX, the greatest value in a; 

5. COUNT, the number of values in a. 

Virtually, the computation of lF(a) is performed by projecting a table m on an 
attribute A and by applying T to the resulting multiset a of A-values. Accord- 
ingly, from the syntactical point of view, in SQL the argument of the aggregate 
function is the identifier of an attribute A belonging to a table listed in the FROM 
clause. 

Example 14- With reference to the multiset mirror of relation Teachers shown 
in Table 2, we may compute the average age of the teachers as follows: 
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SELECT AVG(age) 

FROM Teachers 

which gives rise to the value 45. Instead, if the average function would be applied 
to a set of values, we would obtain a result which is slightly greater than the 
actual average, owing to the elimination of one of the two occurrences of value 
42, which is associated with both Angelique and Martin. □ 

Not all of the aggregate functions are sensitive to duplicate elimination of ele- 
ments within the operand. Specifically, both the least and the greatest values 
within a collection a are preserved after duplication removal. We can express 
this property by means of the S operator introduced in Sect. 3.1 as follows: 

{WT e {MIN, MAX}) (T{a) = T{S{a))) . (65) 



Example 15. To determine the age of the youngest teacher in Table 2, we may 
rephrase the relational-algebra sequence of statements (21) of Example 8 by 
means of the following SQL query: 

SELECT MIN (age) 

FROM Teachers 

which is expected to yield the value 36, relevant to Patricia. Note that the SQL 
version of the query is much simpler than its relational-algebra counterpart^. 
According to Equivalence (65), the same result would be obtained by prefixing 
age with the keyword DISTINCT. □ 

By contrast, the other SQL aggregate functions return in general different results, 
depending on whether duplicates are removed or not, that is: 

(VJP G {SUM, AVG, COUNT}) (E{a) ^ )^(i5(a))) • (66) 

To force duplicate removal, the attribute identifier must be preceded by the 
keyword DISTINCT. 

Example 16. With reference to the multiset mirror of relation Teachers shown 
in Table 2, we may compute the number of departments as follows: 

SELECT COUNT (DISTINCT department) 

FROM Teachers 

which produces the value 3. Note that the qualification DISTINCT is essential to 
tell SQL to discard duplicates before computing the number of departments. □ 

® Among the SQL aggregate functions, only MIN and MAX can be implemented in re- 
lational algebra. Thus, the expressive power of SQL is greater than the expressive 
power of relational algebra. 
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Table 17. Result of SQL query of Example 17 



department 


numc 


Computer science 


5 


Mathematics 


3 


Electronics 


2 



SQL allows aggregate functions to be applied to parts of a table, which are 
collected based on a grouping criterion, expressed by means of the GROUP BY 
clause. More precisely, the multiset of tuples of a table m is partitioned based 
on a set G of attributes, thereby obtaining a partition of m where each part is 
composed of all the tuples which share the same values for attributes in G. The 
aggregate function will be applied to each of these parts and, as a result, a set 
of values, one for each part, will be computed. 

Example 17. With reference to the multiset mirrors of relations Courses and 
Teachers shown in Table 1 and Table 2, respectively, we may compute the 
number of courses relevant to each department as follows: 

SELECT department, COUNT (course) AS numc 
FROM Courses, Teachers 
WHERE teacher = name 
GROUP BY department 

which yields the result displayed in Table 17. The special clause GROUP BY tells 
SQL to make a partition of the join of Courses and Teachers before applying 
the COUNT aggregate function to each part, that is, to each set of tuples relevant 
to each department. Accordingly, the GROUP BY clause is applied after the WHERE 
clause and before COUNT. □ 

Even in the case of grouping, SQL aggregate functions are applied, by default, 
to multisets of values. The qualifier DISTINCT is used to force duplicate removal 
as usual. 

Subqueries and test for absence of duplicates. SQL allows the WHERE 
clause to include SQL subqueries, whose result may represent the operand of a 
Boolean function included in the relevant predicate. For example, we may test for 
the membership of a value within a collection resulting from an SQL subquery 
by means of the IN Boolean function. 

Example 18. Consider the multiset mirror of relation priorities in Table 3 and 
the problem of determining the courses which have some prerequisites but are 
not a prerequisite for any courses, which was already formulated in relational- 
algebra Expression (18). We may rephrase such a query in SQL either using the 
EXCEPT operator or by means of a selection in which the WHERE clause involves 
an SQL subquery as follows: 
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SELECT course 
FROM Courses 

WHERE course IN (SELECT course 

FROM Priorities) AND 
course NOT IN (SELECT prerequisite 
FROM priorities) 

Each tuple in Courses is selected if and only if the relevant course appears 
within the set of courses in Priorities but not in the set of prerequisite courses. 
□ 

Example 19. To show how SQL allows us to test the absence of duplicates in a 
subquery, consider the multiset mirror of relation Courses displayed in Table 1. 
We may find out the courses who are taught by a teacher who teaches just that 
course as follows: 

SELECT course 
FROM Courses 

WHERE UNIQUE (SELECT course 

FROM Courses AS C 

WHERE C. teacher = Courses . teacher 

Note the use of the qualifier AS in the FROM clause, that renames Courses to 
C within the scope of the subquery. Besides, the pathname notation allows us 
to access tuples either in Courses or C. Thus, for each tuple of Courses, the 
predicate of the WHERE clause is evaluated by selecting on a copy of Courses, 
named C, the tuples matching the teacher name. Specifically, the result of each 
subquery returns the multiset of courses which are taught by the current teacher 
in the external query. Based on the UNIQUE Boolean function, the teacher is 
selected if and only if no duplicates are incorporated in the subquery, that is, 
if and only if the teacher teaches just one course, as required. Note that, if we 
substitute UNIQUE by NOT UNIQUE, we will retrieve the courses which are taught 
by a teacher who teaches several courses, that is, the complement of the previous 
query (the new predicate is in fact the logical negation of the previous one). □ 

Database modifications in SQL. Besides query-oriented language capabili- 
ties, SQL provides a means of changing the state of the database, specifically, 
by inserting tuples into a relation, by deleting certain tuples from a relation, or 
by updating the values of certain attributes in certain tuples. 

Example 20. With reference to the multiset mirrors of relation Priorities 
shown in Table 3, we may insert the prerequisite Algebra for Robotics as fol- 
lows: 

INSERT INTO Priorities 
VALUES (’Robotics’, ’Algebra’) 
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Conversely, we may remove all the prerequisites for Robotics as follows: 

DELETE FROM Priorities 
WHERE course = ’Robotics’ 

Finally, with reference to Table 1, we may move the courses of Angelique to one 
year upward as follows: 

UPDATE Courses 

SET year = year +1 

WHERE teacher = ’Angelique’ 



□ 

There are some remarkable points about how insertions and deletions interact 
with the duplicates included in SQL tables. On the one hand, an insertion adds 
the specified tuple irrespective of whether it is already included in the table. On 
the other, the deletion statement, which appears to describe a single tuple to 
be deleted, in fact removes in general several tuples, because there is no way in 
SQL to remove one occurrence only from a collection of identical tuples. This 
characteristics is bound to surprising effects: given a table m containing a tuple 
t, if we followed the insertion of another occurrence of t into m by the deletion of 
t, both occurrences of t would be removed from m. In other words, the insertion 
of t followed by the removal of t would leave the database in a different state 
from what it was before the two operations. 

4 Nested Relational Model 

The success of relational database systems was mainly due to the fact that they 
provide a satisfactory response to the typical needs of business applications, for 
which the idea of databases as large collections of persistent data to be handled 
in an effective, efficient, and reliable way was conceived. 

The most successful features of the relational model are the ease of use of its 
query language, which is set-oriented in nature, compared with the procedural, 
navigational style of earlier proposals, together with the conceptual simplicity 
of the data model. In fact, the relational model is actually based on a single 
data structure, the relation. Each relation contains its own data, and connections 
between data of different relations are implicitly represented by means of equality 
of values. For this reason, as already remarked, the relational model is often 
qualified as value-oriented. 

Business applications usually involve large amounts of data with a relatively 
simple structure. The relational model provides an effective and implementation- 
independent way of specifying this structure while allowing at the same time 
flexible and sophisticated querying capabilities through set-oriented operations 
that act on whole relations rather than a single tuple at a time. For all these 
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reasons the relational model proved itself satisfactory with respect to the require- 
ments of business applications, improving notably the productivity of software 
development in this area. 

This success stimulated the adoption of the database technology in areas 
different from business applications, such as computer-aided design, computer- 
aided software engineering, knowledge representation, office systems, and multi- 
media systems. New applications, however, highlighted a number of shortcomings 
inherent to the relational database technology, among which are the following: 

1. The involved data have a complex structure that cannot be expressed in a 
natural way in the relational model; 

2. The relationships among data that derive from their semantics are very 
complex and cannot be efficiently stored in a value-oriented way; 

3. Relational languages lack expressive power for most applications outside the 
business area. 

The first step that was devised in the direction of widening the range of appli- 
cability of database systems was to extend the relational data model. This idea 
can quite naturally be understood starting from the consideration that two out 
of the above three limitations of relational systems arise from the simplicity of 
their data model. It looked reasonable to extend the data model, without losing 
its positive features, in order to explicitly represent data structures more com- 
plex than flat tuples of values. This would have also solved the second problem, 
that is, efficiently storing related data. As to the solution of the third prob- 
lem, relational languages should have been extended in order to cope with more 
complex data structures, while retaining their set-oriented, declarative style: this 
extension, in the original idea, should have supplied the lacking expressive power. 

Perhaps, the most noteworthy extension of the relational model was the 
nested relational model, according to which the assumption of atomic attributes 
(flat relations) is relaxed. Such an assumption, which excludes the possibility 
that an attribute value be a collection of other values, is called First Normal 
Form (INF). 

The standard relational model that derived implicitly from this assumption 
is therefore a flat relational model. Then, a relational database schema consists 
of relation schemas of the form: 

R{A\ : D\, . . . , An : Dn) (67) 

where each Di is an atomic domain. The easiest way of defining a data model that 
allows for the representation of complex data structures is the direct extnsion of 
the relational model obtained by relaxing the INF assumption. A nested relation 
is defined in terms of (possibly complex) attributes. A complex attribute is in 
turn a (possibly nested) relation. Nested relations can be manipulated by means 
of special purpose languages [30, 6, 22, 1, 18, 8, 3, 43, 14, 10], among which are 
various extensions of relational algebra. In the nested data model, also called Non 
First Normal Form (^ INF), attribute values can be nested relations themselves, 
with unbounded depth. A nested relation schema R could be expressed as follows: 

R{Ai : Tu . . . , An : Tn) (68) 
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where each Tj is either an atomic domain D or a, nested relation schema of the 
form: 

{Aj^ : Tj^, . . . ,Aj^^ : Tj_^_) . ( 69 ) 

In the following, when irrelevant, atomic domains will be omitted. 

As the notation suggests, the notion of a nested relation is the natural ex- 
tension of the notion of a flat relation. A nested relation is a set of nested tuples 
in just the same way as a flat relation is a collection of flat tuples. A nested 
tuple associates a value from the corresponding domain with each attribute in 
the schema, as in the flat case. A flat tuple is a particular case of nested tuple, 
in which all the attributes are associated with an atomic domain. The value 
associated with an attribute is atomic if the attribute is simple, otherwise it is 
a nested relation. In the latter case the attribute is complex. 

Even if the definition of nested relations is recursive, the schema is supposed 
to have a finite depth, because at each level every complex attribute must corre- 
spond to a new relation schema: it cannot be associated with a relation schema 
of the upper levels. Cyclical paths in the schemas are not allowed: the schema of 
a nested relation can therefore be conveniently represented by a tree, where the 
root is the name of the external nested relation, simple attributes are represented 
as leaves, and complex attributes correspond to internal nodes. 

Nested relations and complex attributes have exactly the same properties as 
far as their structure is involved: we make use of the same definition for both of 
them. However, a nested relation is an instance consisting in a single set of nested 
tuples. By contrast, there are as many instances of complex attributes as tuples 
in its parent relation. Therefore, if the (external) nested relation instance consists 
of n tuples, a complex attribute corresponds to n sets of tuples. This asymmetry 
notably complicates the query language, even if it allows the expression of most 
queries more concisely. 

Another consequence of this extension is that it gives a more complex seman- 
tics to elementary operations on tuples and attributes. For example, in the flat 
case, comparing two attribute values involves only a comparison of two atomic 
values, while in the nested model it requires a more complex comparison between 
sets when the attributes are complex. 

Moreover, new operators are required for attribute comparison, such as in 
selection predicates. Besides the classical comparison operators, such as =, >, 
and >, it is necessary to introduce a number of relational operators for sets, such 
as D (inclusion) . Other set-oriented operations like U (union) and n (intersection) 
need also be included in the language, not only for relations, as in the flat case, 
but also for attributes. 

Formally, an extended relational database schema S' is a collection of rules. 
Each rule has the form 

i?= (Si,... ,S„) (70) 

where R, Ri, . . . , Rn, which are called names, are distinct and there is no order- 
ing on Si, . . . , Rn- The names on the right-hand side of the rule S form a set, 
which is denoted by Sr. Each rule has a different name on the left-hand side. 
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A name is a higher- order name if it occurs on the left-hand side of some rule; 
otherwise, it is zero order, or attribute name. Higher order names correspond to 
nested relations or attributes with a nested structure, while zero order names are 
ordinary, atomic attributes. Rules in the database schema associate each name 
with its structure. Since the structure is expected to have a finite depth, the 
structure of rules cannot be cyclic: this requirement will be specified below. 

A name is external if it occurs only on the left-hand side of some rule; oth- 
erwise, it is internal. External names correspond to complex relations of the 
database, while internal names are attributes, either simple or complex. 

Given an external name R in an extended relational database schema S, 
consider the smallest subset S' of S including: 

1. The rule with R on the left-hand side; 

2. For each higher-order name Rk on the right-hand side of some rule in S' , 

the rule with R^ on the left-hand side. 

S' is called the (nested) relation schema corresponding to R. The set S' corre- 
sponds to the rules in S that are accessible from R. We will normally identify a 
relation schema by its external name R rather than by listing explicitly the set 
S' of rules. 

Given a relation schema R, we can define a unique schema tree of R, written 
Gr. Thus the schema tree of a relation schema S' with external name i? is a tree 
rooted in R. The internal nodes of the tree will be the left-hand sides of other 
rules in S' , and the leaves of the tree will be zero-order objects in the rules of 
S' , representing basic attributes. Each non- leaf node of a relation schema tree 
represents a collection of tuples, each composed of the children of the node. The 
nodes of Gr are exactly the names in the rule R. Gr contains a directed edge 
from R to R' if and only if R' G Er. 

An extended relational database schema S consists of one or several relation 
schemas, possibly sharing some attributes. A unique schema graph of S, written 
Gs, can be defined, which is a graph resulting from the merging of the schema 
trees of all the relation schemas included in S. 

An extended relational database schema S is valid if and only if Gs is a 
directed acyclic graph: this implies that the hierarchical structure of a nested 
relation has an unlimited but finite depth. 

Two valid nested relation database schemas are equivalent if their schema 
graphs are isomorphic, in other words, if the graphs are equal up to renaming of 
internal (non-leaf) nodes. 

Example 21. Gonsidering the relations Courses, Teachers, and Priorities dis- 
played in Tables 1, 2, and 3, respectively, we may define an extended database 
schema S as follows: 



Departments 

teachers 

courses 

Priors 

prerequisites 



(department, teachers) 
(name, age, courses) 
(course, year) 

(course, prerequisites) 
(prerequisite) 



( 71 ) 
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Departments 



Priors 




prerequisites 

prerequisite 



Fig. 2. Schema graph Gs of the extended database schema defined by Rules (71) 



According to our definitions, this schema embodies two external names only, 
namely Departments and Priors. Higher-order names include Departments, 
teachers, courses, Priors, and prerequisites. Zero-order names are name, 
age, course, department, year, and prerequisite. The relevant schema graph 
Gs, which happens to be valid, is shown in Fig. 2. □ 

Having defined an extended relational database schema, we now turn to the 
problem of defining an instance of a complex relation. We want to define in- 
stances so that they are independent of the order of columns. We have simply 
to associate with each atomic attribute a single value taken from an appropriate 
domain, and to each higher-order name a set of tuples of the appropriate type. 
Since the ordering of names is immaterial, each value must be labeled by its 
name. Like for the fiat relational model, we assume the existence of a function 
Dom that associates with each zero-order name its domain. 

An instance of a name R, denoted by r, is an ordered pair (A, Vr), where 
Vr is a value for name R. If i? is a zero order name, a value is an element of 
Dom(i?). If i? is a higher-order name, a value is a set {t} of tuples t, where t 
contains a component {Ri, VrJ for each R^ G Ar. 

Note that two different complex relations can share part of the schema, that 
is, they can have some inner attribute with the same name and schema; however, 
they cannot share part of the instance: every relation is completely independent 
of any other relation. That is, nested tuples cannot share common sub-objects, 
and every update on a tuple or relation is confined to the tuple or relation. 

The schema and instance associated with the same external name R form a 
structure. A structure is therefore a pair (i?, r), where R is an external name and 
r an instance of R. A database structure {S, s) is a database schema together 
with an instance for its external names. 

Example 22. With reference to Example 21, instances of the nested relations 
Departments and Priors are shown in Tables 18 and 19, respectively. 

The instance of the nested relation Departments is composed of three tuples, 
each of which refers to a specific department. Associated with each department 
is a set of teachers, each of which is characterized by a name, an age, and a set 
of courses. A course is described by a name and the year in which it is taught. 
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Table 18. Instance of the nested relation Departments 



department 


name 


teachers 0 

age courses 0 

course 


year 





Computer science 


Angelique 


42 


Operating systems 


4 










Software engineering 


4 






Carol 


38 


Databases 


3 1 




Jerri 


54 


Compilers 


5 










Programming languages 


3 




Electronics 


Patricia 


36 


Computers 


2 1 




Richard 


45 


Robotics 


5 1 


Mathematics 


Gregory 


63 


Calculus 


1 1 




Martin 


42 


Geometry 


2 1 




Sheila 


40 


Algebra 


1 1 



As such, the instance of Departments incorporates in a hierarchical fashion all 
the information contained in Tables 1 and 2. Similar considerations apply to 
Table 19, which represents the instance of the nested relation Priors. In fact, 
each course is associated with all its preceding courses by means of the complex 
attribute prerequisites. □ 

4.1 Nested Relational Algebra 

A meaningful amount of work on the extended relational model was devoted 
to the definition of query languages. Almost all paradigms of languages for the 
relational model have been extended to the nested case. The advantage of the 
extended relational model is here particularly evident: in principle it is not nec- 
essary to design new query languages, all is needed is the freedom to apply old 
constructs of relational query languages to more complex data structures. More 
formally, we need a language design that is fully orthogonal: since relations can 
now occur not only at the outermost level as external operands but even as com- 
plex attributes, the same operators should be applicable at the attribute level 
as well. 

Due to the increased complexity of the data model, the algebraic approach 
has become more popular for the nested model than it was for the classical 
flat one, imposing itself as the predominant stream of research on languages 
for nested relations. An operational approach (as opposed to the declarative 
approach of the calculus-based and rule-based languages) is more appropriate, for 
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Table 19. Instance of the nested relation Priors 



course 



prerequisites O 
prerequisite 



Compilers 

Computers 

Geometry 

Operating systems 

Programming languages 

Robotics 

Software engineering 



Software engineering 



Calculus 



Algebra 



Programming languages 



Computers 



Geometry 

Programming languages 



Programming languages 
Databases 



example, for specifying the schema-restructuring operations that are particularly 
relevant in the extended relational model. 

This is accomplished through the introduction of two new algebraic operators, 
namely nest and unnest, defined later in this section, that allow creation and 
deletion of complex attributes starting from atomic domains. 

Actually, the first proposals inherent to nested algebra involved these opera- 
tors only, and did not discuss the extension of algebraic operators to the attribute 
level [22, 30, 42]. The idea was that, whenever relation- valued attributes are to 
be manipulated, one could first unnest, apply the standard relational operators 
and finally re-nest to obtain the desired result. However it was soon noticed 
that this cannot work in general, since unnest may not be reversible by nesting 
operations; moreover, this is neither an efficient nor a natural way of computing 
the results. 

Subsequent research on algebraic languages for nested relations was then fo- 
cused on investigating the expressive power of the algebra extended with the 
nest/unnest operators and on designing languages that were suited for efficient 
implementation and that allow for the manipulation of complex attributes with- 
out unnesting them first. 

Algebras for manipulating complex attributes have been proposed by many 
authors [2, 17, 21, 29, 39, 40, 41]. In these algebras, ordinary algebraic operators 
are extended to accept nested relations as their operands. Certain algebras de- 
fine set operators (union, difference, etc.) that apply recursively to all complex 
attributes of their operands. 

The approach we follow in this section is to proceed from the more imme- 
diate and straightforward extensions to the more complex and specific ones. 
Specifically, we introduce the following class of extensions to relational algebra: 
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1. Set-theoretic operations and product extended to nested operands; 

2. Nest and unnest; 

3. Operators involving a predicate; 

4. Extended projection; 

5. Extended selection; 

6. Expressions involving nested applications of operators. 

Extension to set-theoretic operations and product. The basic operators 
on sets, namely union, difference, and intersection, are defined exactly as for 
fiat relations: the only difference is that domains of attributes may now be ei- 
ther atomic or set-valued. The set operations are always performed at the most 
external level, that is, on tuples of external names, in other words, we cannot 
perform set operations on tuples of nested attributes. 

Set-theoretic operations usually require operands to have the same domain. 
However, we can relax this requirement on relation schemas: the two schemas 
may also be equivalent (that is, isomorphic up to a renaming), if we give some 
rule to determine the names for the attributes of the result. To this end, we 
can introduce the following rule: when two schemas are equivalent but do not 
have the same attribute names, the result inherits the names of the first schema. 
Alternatively, an explicit renaming operator can be introduced as in the case of 
relational algebra. 

Also the extension of the product is straightforward, since it only involves 
the extension of the schema at the external level. 



Nest and unnest. These two operators produce a result obtained as a modifi- 
cation of both schema and instance of the operand. Informally, nest, denoted by 
V, builds a higher-order attribute within an external relation starting from one 
or more atomic attributes, thus creating a further level of nesting. On the other 
hand, unnest, denoted by /r, deletes a higher-order attribute. When nesting, a 
set is created containing all the tuples of nested attributes having the identical 
values on the non-nested attributes. For unnesting, each tuple of the unnested 
attribute is concatenated with the external tuple containing the unnested at- 
tribute, thus resulting in a sort of tuple-oriented product. As remarked in [30], 
nest and unnest are actually the inverse of each other. However, while an unnest 
can always restore the situation previous to a nesting, the inverse is not in general 
true. 

Unnest. The definition of /r can be formalized as follows. Given a database 
schema S, let r be a relation with schema R in S. Assume B is some higher-order 
name in Er with an associated rule B = (Hi, . . . , Bm). Let {C\, . . . , Ck} = 
Hr — B. Then 






( 72 ) 



is a relation r' with schema R' where: 
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Table 20. Result of the unnesting in Expression (73) 



department 


name 


teachers 0 
age course 


year 











Computer science 


Angelique 


42 


Operating systems 


4 




Angelique 


42 


Software engineering 


4 




Carol 


38 


Databases 


3 




Jerri 


54 


Compilers 


5 




Jerri 


54 


Programming languages 


3 


Electronics 


Patricia 


36 


Computers 


2 




Richard 


45 


Robotics 


5 


Mathematics 


Gregory 


63 


Calculus 


1 




Martin 


42 


Geometry 


2 




Sheila 


40 


Algebra 


1 



1. R' = (Cl, . . . , Ck, Bi, . . . , Bm) and the rule B = (i?i, . . . , Bm) is removed 
from the set of rules in S if it does not appear in any other relation schema; 

2. r' = {t\u& r,t(^Ci,...,Ck) = G U(b)}- 

Example 23. With reference to the nested relation Departments displayed in 
Table 18, the unnesting of collection courses: 

Mcourses (Departments) (73) 

is expected to yield the nested relation whose schema and instance are shown in 
Table 20. Owing to the unnest of courses, the result incorporates one tuple of 
teachers for each department. □ 

Nest. The definition of v can be formalized as follows. Given a database schema 
S, let r be a relation with schema R in S. Let {i?i, . . . and {Ci, . . . ,Cfc} 

= Cr — {Bi,. . . ,Bm]. Assume that B does not occur in the left-hand side of 
any rule in S. Then 

VB = (B^,...,Bra)('^) C^4) 



is a relation r' with schema R' where: 

1. R' = (Cl, . . . , Cfc, i?), where rule B = (i?i, . . . , B^) is appended to the set 
of rules in S'; 

2. r' = {t\u£ r,t(Ci,.„,cG = M(Ci,... ^(b) = {v(Bu-..,b^) I ^ G r, ,c,) 

= t(Ci,...,Ck)}}- 
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Example 24- To show the use of the nest operator, we show how we can gener- 
ate the nested relation Priors displayed in Table 19 starting from the relation 
Priorities outlined in Table 3 as follows: 

^ ^prerequisites — (prerequisite) 

In the result, prerequisite courses relevant to the same course are grouped into 
the complex attribute prerequisites specified in the operator. □ 

Operators requiring a predicate. This class of operators includes selection 
and join. Predicates are more difficult to define in the extended relational model, 
due to the possibility of different nesting depths for the attributes. The problem 
can be illustrated as follows. Assume we have the following database schema: 
R = (A, M), M = (B, C), V = (D, N), N = (E, F), and consider the following operation: 

R Mc=e V . (76) 

It is not clear at which level the tuples of the product of R and V should be 
selected. Specifically, should all the combinations of tuples in the inner collections 
be verified? Or the selection is taking place at the outermost level, selecting those 
tuples whose inner collections agree on the value of attributes C and E for all 
their tuples? 

This semantical ambiguity (arising only in specific comparisons among at- 
tributes) depends on the (so called) quantification level of the attributes involved 
in the predicate. The general solution to this problem will be presented later. At 
the moment, we confine ourselves to extending the form of predicates in order 
to account for set-valued nested attributes. Thus, in this section, we consider 
selections and joins whose predicate contains attributes belonging only to the 
outermost level, that is, to the schema rule of the external name involved in the 
operation. 

Extended propositional formula. Let i? be a relation schema. A propositional 
formula p over Ar is defined recursively as follows. Atoms over Ar have the 
form A 1 OA 2 or Aida, where both Ai and A 2 are in Ar, a is a constant, which 
can be set-valued, and 6* is a comparison operator, that is, 9 € {=,<,>, > 

,<,D,D,c,C,g}. Every atom over Ar is a propositional formula over Ar; if 
pi, p2 are propositional formulas over Ar, then ^(pi), pi A p2, and pi V p2 are 
formulas over Ar. Parentheses can be used as usual. Nothing else is a formula. A 
propositional formula associates a Boolean value with each tuple in the instance 
r of R. 

Selection. Given a relation r over the schema R, the selection of r with respect 
to p, denoted by Op{r), is a relation over the same schema R, containing the 
tuples of r that make p true: 

{t G r I p{t)} . (77) 

The only changes introduced with respect to relational algebra are set compari- 



son. 
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Example 25. With reference to the nested relation Priors displayed in Table 19, 
to find out the courses which require Programiming languages as a prerequisite 
we write: 



^‘Programming languages’ C prerequisites (P^fors) . 

where ‘ProgrEunming languages’ is a constant. This query will select the fourth, 
sixth , and seventh tuple of Priors. □ 

Join. The theta-join between nested relations can be defined based on Equiva- 
lence (10), that is, as a selection of a product: 

ri ixip V2 = CTp(ri X ra) . (79) 

where rq and T 2 are nested relations and p an extended propositional formula, 
as defined above. The extension of the natural join to nested relations comes 
without any surprise, once considered that the implicit predicate of equalities 
between homonymous attributes may in general involve complex attributes. 

Extended projection. The operators introduced so far, with the exception of 
nest and unnest, are a direct extension of relational algebra, but their expressive 
power is inadequate, since they lack the capability of accessing inner collections. 
In order to perform even simple queries, a lot of nesting and unnesting is therefore 
needed. 

Example 26. Considering Departments in Table 18, to select the departments 
which refer to (at least) a course of the last two years, we cannot access directly 
attribute year in the selection predicate since it belongs to the inner relation 
courses. Thus we need to unnest in cascade courses and teachers so as to 
apply the selection on the resulting flat relation: 

’^department (^year >4 (/^teachers (/^courses (department s) ) ) ) (^0) 

□ 

The solution to this problem is to further extend relational operators by allowing 
the manipulation of inner collections. We first introduce projection so as to make 
it possible to project also on inner collections. Then we extend set operations so 
that they can perform union, difference, and intersection of inner attribute 
Given an external name R, consider the set S' of rules corresponding to its re- 
lation schema. The projection ttai,... ,a„ (R) defines the projection list A\,. . . , A„: 
each name Ai must occur in the right-hand side of just one rule in S' . This is 
called the uniqueness constraint. The result of the projection has a relation 
schema S" obtained by replacing the rules of S' with their projection on the 
projection list as follows: 

1. Include in S" the rules of S' that contain in their right-hand side one or 
several names of the projection list, limiting the right-hand side to names of 
the projection list; 
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2. Include in S” the rules of S' that have on the left-hand side a name appearing 
in the rules of S" that does not already occur on the left-hand side of a rule 
in S"; 

3. Include in S" the rules of S' that contain in their right-hand side a name 
appearing in S", limiting the right-hand side to names appearing in S"; 

4. Apply Steps 2 and 3 until no more rules are added to S" . 

The instance r' of the result is obtained from the instance r of i? by projecting 
each tuple on the new relation schema S" 

We have a problem with the formalism: the ‘projected’ rules are valid for the 
result of the projection, but they are also used in the operand schema, so we 
should keep also the original, non-projected version. The simplest solution is to 
assume that, as in the flat relational model, each algebraic operation builds a 
temporary (nested) relation, that can be used as the operand of another opera- 
tion or stored as a persistent relation. In both cases, the schema of the relation is 
defined by new rules, that are added and not substituted to the original version. 
Names should be changed accordingly, so as to respect the uniqueness constraint. 

Example 21. With reference to Departments outlined in Table 18, the following 
projection: 



^department , name, courses (Departments) 



(81) 



will result in the nested relation shown in Table 21. 



□ 



Extended selection. In our previous version of the selection operator of nested 
relational algebra, selection was only possible at the outermost level, the level 
of tuples of the external relation. Informally, in order to be able to use inner 
attributes in the predicate, we need a way to decide: 

1. On which collection the selection is performed; 

2. Which values should be compared in a predicate atom. 

We first formalize the concept of a quantification level, informally introduced 
above: constants and external names have quantification level 0 (that is, there 
is only one instance for them in the database); the quantification level of each 
name occurring in the right-hand side of a rule, for which the left-hand side 
name has quantification level i, is i -I- 1. 

We now extend the notion of an atom in a selection predicate. Let i? be a 
relation schema; atoms over have the form Ai9A2 or Ai9a, where a is a 
constant, 0 is a comparison operator, 9 e {=, <, >, yf, >, <, D, D, c, C, g}, and 
A\, A 2 are names occurring in the relation schema such that: 

(a) The quantification level of A\9a is that of the parent of A\ and the atom 
refers to its tuples; 

(/3) If Ai and A 2 are siblings (they have a common parent) in Gr, (the schema 
tree corresponding to R), then the quantification level of A\9A2 is that of 
the common parent, and the atom refers to its tuples; 
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Table 21. Result of the extended projection (81) 



department 


name 


teachers 0 

coursesO 

course 


year 





Computer science 



Electronics 



Mathematics 



Angelique 

Carol 

Jerri 



Patricia 

Richard 


1 Computers 


2 1 


1 Robotics 


5 1 






Gregory 


1 Calculus 


1 1 


Martin 


1 Geometry 


2 1 


Sheila 


1 Algebra 


1 1 









Operating systems 4 

Software engineering 4 

I Databases 3 | 

Compilers 5 

Programming languages 3 



(7) If a sibling of Ai is an ancestor of A 2 (or vice versa) in Gr, then the quan- 
tification level of A 10 A 2 is that of the parent of A 2 (respectively Ai) and 
the atom refers to its tuples. 

According to the above axioms, external constants and external names are con- 
sidered as siblings of any external name (the quantification level is 0). 

Propositional formulas of predicates are built by means of atoms. For each 
atom involved in a propositional formula, we consider the tuples to which the 
atom refers. Operands of binary logical connectives must be atoms or other 
formulas whose corresponding tuples still obey to the above axioms. The logical 
connective will refer to tuples of the lower quantification level, unless they belong 
to the same parent collection. 

Therefore, we have given axioms to recursively determine the quantification 
levels and tuples to which the predicate refers; the selection is performed only 
on these tuples. 

Example 28. Based on the above axioms, the following extended selection: 

f^department— ‘Computer science’ A age>4o(bepart:iIierit:s) 

will (somewhat surprisingly) result in the nested relation displayed in Table 22. 
With reference to Axioms (a), (/3), and (7), the quantification level of the selec- 
tion predicate p within Expression (82) is computed as described in Table 23. 
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Table 22. Result of the extended selection (82) 



department 


name 


teachersO 

age courses () 

course 


year 





Computer science 



Angelique 

Jerri 



Electronics 

Mathematics 



42 



54 



Operating systems 4 

Software engineering 4 



Compilers 5 

Programming languages 3 



Table 23. Steps to yield the quantification level of the selection predicate p in (82) 



Predicate 


Description 


Axiom 


Quantification level 


pi 


department = ‘Computer science’ 


(a) 


Departments 


P2 


age > 40 


(a) 


teachers 


P 


Pl A p2 


( 7 ) 


teachers 



Accordingly, tuples of teachers are selected based on the value of predicate p. 
Within Table 22, the nested attribute teachers is empty for both departments 
Electronics and Mathematics, because every tuple of them does not satisfy 
predicate pi and, consequently, p. Besides, removing a tuple from teachers 
causes the removal of all its attributes, specifically, courses. □ 

Note that, generally speaking, when the quantification level of a selection is in- 
ternal, that is, the selection is applied to an internal relation i? (in Example 28, 
R — teachers), duplicates may be generated among the tuples R belongs to 
(in Example 28, the tuples of Departments). In fact, in this case, the selec- 
tion operates as a modification of attribute values, which is bound to generate 
duplicates. 

Expressions involving nested applications of operators. Since operands 
are nested relations, it is worth nesting the operators. For example, the selection 
predicate could consist of a comparison between the results of the selections 
on two inner attributes. This way, relational operators can be applied to inner 
attributes as well, thereby extending the expressiveness of the algebra. 

Nested expressions are used also in languages for the flat relational model: 
for example, SQL allows nested queries, that are generally used as an alternative 
to joins between relations. When the structure of relations is more complicated, 
the need for nested expressions becomes a natural requirement. 
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Nested expressions in the context of extended algebras were first studied in 
[29] (where they are called recursive expressions) and [41]. 

Essentially, nested algebraic operations can be defined on the basis of the 
following principles: 

1. A relational expression can occur wherever a relation name is needed (this 
was implicitly assumed in some of our previous examples). 

2. A relational expression can occur wherever an attribute name is expected. 

Example 29. To find out the departments that include (at least) a teacher younger 
than 40 we may write: 



"^department (^(<Tage < 4 o(teachers)) 5^0 (Departments)) (83) 

In Expression (83), a selection is applied to Departments, whose predicate in- 
volves a complex comparison between a nested selection on teachers and the 
empty set. Intuitively, each tuple of Departments is selected if and only if the se- 
lection of the corresponding attribute teachers based on the simple comparison 
age < 40 yields at least a tuple, in other words, when the department includes 
a teacher younger than 40. This holds for Computer science and Electronics 
only, which are in fact the departments displayed after the final projection. □ 

Example 30. To find out the departments that include (at least) a teacher older 
than 50 and teaching a course of the last two years we may write: 

^department ^0 (Departments)^ 

In Expression (84), a selection is applied to Departments, whose predicate in- 
volves a complex comparison between a nested selection on teachers and the 
empty set. Besides, the nested selection on teachers involves a complex com- 
parison between a nested selection on courses and the empty set. The latter 
selection is based on the simple comparison year > 4. This way, the selection 
operation is propagated through the nested schema of the operand Departments. 
The final result is expected to include the singleton {(Computer science)}. □ 



5 Multisets in Nested Relational Database Systems 

In Sect. 3, we showed how the relational model may be extended to deal with 
multisets. In particular, in Sect. 3.1, the operators of relational algebra have 
been extended to manipulate multisets. Then, in Sect. 4 we showed how the 
relational model can be extended to incorporate nested relations and how these 
nested relations can be manipulated by means of a nested relational algebra. 

It is natural at this point to combine the two extensions, namely multisets 
and nested relations, into a new data model in which multisets can be nested 
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within complex relations. Intuitively, a complex relation is like a nested relation 
incorporating both sets and multisets^. 

We may easily extend the formal definition of a nested relational database 
schema given in Sect. 4 to a complex relational database schema by making rules 
polymorphic, that is, a rule R may be either 



R — (i?i, i?2, ■ • 


■ ,Rn) 


(85) 


R = [^l7 R2, ■ ■ 


■ ,Rn] 


(86) 



where, intuitively. Rule (85) defines a set, while Rule (86) a multiset. 

Example 31. Shown in Table 24 is a complex relation called Products, whose 
schema is defined by the following rules: 

Products = ( product, components, time ) , , 

r , (0‘) 

components = [component] 

Each product is associated with a multiset components^ and a time needed to 
assemble the product by means of the collection of components. Thus, according 
to the simplified information stored in Products, a car is composed of a body, an 
engine, four occurrences of wheel , two occurrences of light, and a tauik. Then, 
each component is itself a product, possibly composed of other components, and 
so on, until reaching atomic components, such as light or tank. □ 



5.1 Nested Relational Algebra for Complex Relations 

Intuitively, we may define a nested relational algebra for complex relations by 
combining the concepts inherent to the relational algebra for multisets intro- 
duced in Sect. 3.1, and the relational algebra for nested relations presented in 
Sect. 4.1. In fact, there is almost nothing surprising in integrating the two ap- 
proaches into a uniform operational framework. Among other papers, the book 
[10] presents an extensive description of a nested relational algebra for com- 
plex relations. However, in this section, we only focus upon a limited variety of 
computational aspects relevant to the manipulation of complex relations. 



Type transformation. Generally speaking, a complex relation is polymorphic 
in nature, as it may be either a set or a multiset and include both set and multiset 
attributes. For example, the complex relation Products shown in Table 24 is a 

^ Complex relations may include other kinds of collections, like lists and arrays. How- 
ever, according to the scope of this paper, we confine our discussion to multisets 
only. 

® Within the schema, the name of a multiset, such as components, is followed by square 
brackets to distinguish it from a set. 
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Table 24. Instance of the complex relation Products 



product 


components [] 
component 


time 




car 


body 


32 




engine 






wheel 






wheel 






wheel 






wheel 






light 






light 






tank 




body 




0 


engine 


block 


18 




block 






carburetor 




block 


cylinder 


26 




piston 






valve 




wheel 


tyre 


4 




tube 




light 




0 


tank 




0 


carburettor 




0 


cylinder 




0 


piston 




0 


valve 




0 


tyre 




0 


tube 




0 



set, but the internal collections relevant to attribute components are multisets. 
That is, the schema of Products involves both set and multiset constructors. 
Since these two collection types may coexist within the same complex relation, 
it makes sense to define an operator which allows us to change the set into a 
multiset and vice versa®. 



Such a change, however, must always be interpreted within a functional framework, 
where operands never change their state but, rather, a new complex relation is gen- 
erated with the changed schema. 
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Let r be a complex relation, and £ = Ai, A„ a (possibly empty) set of 
attribute names of r. Then, the following expression: 

r^(r) (88) 

denotes the type transformation of r, which results in a complex relation r' such 
that: 

1. If £ = 0, then the type of r' is changed with respect to the type of r; 

2. If £ 7 ^ 0, then the types of attributes of v' in i are changed with respect to 
the corresponding types in r; 

3. The instance of r' is obtained from the instance of r in accordance with the 
type transformations performed in Points I and 2. 

Example 32. With reference to Table 24, to generate a new complex relation 
obtained by transforming Products into a multiset we may write: 

r(Products) . (89) 

Instead, to change the nested multiset components into a set we write: 

^components (P^^ducts) . (90) 

Consequently, the duplicates originally contained in multiset components (see 
Table 24) will be removed from the result. □ 

Type transformations are not only useful but even necessary to specify queries 

appropriately. In set-theoretic operations, since we require the equality of the 
operand schemas, a renaming might be in general not sufficient if one operand 
is a set and the other a multiset. Moreover, we require as well that product and 
join operate on complex relations of the same type. Thus, type transformation 
might be needed in this case too^. 



Nest and unnest. Nest and unnest operations were introduced in Sect. 4.1 
for nested relations. Instead of extending such operations formally, we prefer 
providing some intuitive guidelines. 

Unnest. The unnest operation creates a complex relation by reducing by one 
level the depth of the operand schema tree. Given an instance r of a complex 
relation schema R defined as follows: 

i?(.4i,... ,A„,/3(Bi,... ,B„)) (91) 

^ Another viable approach might be to have implicit casting rules (as in general- 
purpose programming languages in which, for example, an integer is transformed 
into a real), where the type of a complex relation is automatically transformed into 
another type based on the specific computational context. 
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Table 25. Result of Expression (94) 



product component time 



car 


body 


32 


car 


engine 


32 


car 


wheel 


32 


car 


light 


32 


car 


tank 


32 


engine 


block 


18 


engine 


carburetor 


18 


block 


cylinder 


26 


block 


piston 


26 


block 


valve 


26 


wheel 


tyre 


4 


wheel 


tube 


4 



where angles denote either set or multiset collections, the following operation: 

ixnir) (92) 

generates a complex relation r' with the following schema R'\ 

R' {Ai, . . . , Am, B\, ■ ■ ■ , Bn) (93) 

Since more than one tuple on _Bi, . . . , exists in R for each tuple Ai, . . . , Am, 
r' will be built joining the tuples together. The type of R' is determined from 
the type of R and not from the type of f3. Accordingly, the tuples resulting from 
the concatenation of the tuples of R and (3 will be considered as belonging to a 
complex relation of the type of R. Two cases are possible: 

1. i? is a set: the multiple occurrences of the tuples on (3 will be reduced to one; 

2. i? is a multiset: the multiple occurrences of the tuples of (3 will generate 
multiple occurrences in r' too. 



Example 33. With reference to the complex relation Products shown in Ta- 
ble 24, the following operation: 

^components (P^^ducts) (94) 

will result in the (flat) set displayed in Table 25. Note that the unnesting of 
empty collections give rise to the removal of the corresponding tuple. □ 

Nest. Given an instance r of a complex relation schema R defined as follows: 

R{A \, . . . , Am, B\, . . . , Bn) (95) 

where each A^ and Bi can be a complex attribute, the following operation: 

J^/3=(Bi,...,B„>(t) (96) 
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generates a complex relation r' with the following schema: 

,B„)) . (97) 

The instance of r' is built as follows: tuples having the same projection on 
attributes Ai, . . . , Am are grouped together and for each group we consider the 
corresponding parts Bi which form a set or a multiset, according to the type of 
[3. Two cases are possible: 

1. i? is a set: only one tuple is maintained in r' , which is obtained through the 
concatenation of the Ai, i G [1 ..m], and /3; 

2. i? is a multiset: a tuple of r' is created for each tuple of r; within the tuples 
of the same group the part (3 is repeated. 



Example 34- Shown in Table 26 are a multiset Flat (left) and the result of the 
following nest operation (right): 



^components— (component) (Flat) 



(98) 



Expression (98) generates a complex relation of type multiset with a nested 
set called components. Therefore, according to the above rules, for each tuple 
in Flat, a tuple is created in the result, where, within the tuples of the same 
group, the instance of the newly created attribute is repeated. Note, however, 
that, since components is defined as a set, duplicates are removed from the 
nested collections. □ 



Table 26. Multiset Flat (left) and result of Expression (98) (right) 



product 


component 


product 


components 0 
component 


car 


wheel 






car 


wheel 


car 


wheel 


car 


engine 




engine 


engine 


block 






engine 


block 


car 


wheel 


engine 


carburetor 




engine 






car 


wheel 








engine 






engine 


block 








carburetor 






engine 


block 








carburetor 






engine 


block 








carburetor 
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Table 27. Result of Expression (101) 



component 

body 

engine 

wheel 

light 

tank 

block 

carburetor 

cylinder 

piston 

valve 

tyre 

tube 



Complex aggregate operations. We have already introduced the notion of an 
aggregate operation in Sect. 3.4 within the context of SQL. Considering complex 
relations, it is possible to introduce new kinds of aggregate operations that are 
applied to collections of complex (rather than simple) values. Specifically, these 
are complex union and complex intersection, denoted by U'^ and n'^, respectively. 

Let r be a complex relation involving an attribute A, either of type set or 
multiset®. The complex union of A within r is defined as follows: 

Ua (0 = Ui6. (f(A)) • (99) 

Likewise, the complex intersection of A within r is so defined: 

(r) nter (t(A)) • (100) 

Note that Definitions (99) and (100) can be generalized to deal with an attribute 
A which is at any level within the schema of r, not necessarily at the level of the 
tuples of r. 

Example 35. With reference to the complex relation Products displayed in Ta- 
ble 24, we might ask for the set of products that are components of some other 
product as follows: 



'r(Ucomponents(ProduCts)) (101) 

which gives rise to the set displayed in Table 27. Since both union and intersec- 
tion of multisets result in a multiset, the final type transformation is needed in 
order to remove duplicated components. 

Note that, if we replaced in (101) U'^ with n'^, we would obtain the empty 
set. □ 



Attribute A can be itself complex, that is, it may be defined in terms of other complex 
attributes. 




On Multisets in Database Systems 



197 



function FixedPoint(r): relation resulting from ip[A <— J-p{A,r)-, Tc{A,r)](r) 

input 

r: the operand relation; 
begin 
r' := r; 
stop := false; 
repeat 

A:=Mr'y, 
if J-p{A, r') then 
stop := true 
else 

r' := Tc{A,r') 
until stop; 
return r' 
end. 

Fig. 3. Semantics of the fixedpoint operation defined in Formula (102) 

Fixedpoint operation. An interesting extension to the algebra for complex 
relations is the fixedpoint operator, which allows recursive queries to be expressed 
in an algebraic style without the use of control structures or recursion. Such an 
operation can be seen as a generalization of the closure operation introduced in 
various advanced query languages. 

The fixedpoint operation applies repeatedly a function Tt, called the trans- 
former, to a complex relation r, combining the result with r using another func- 
tion Tc, called the combiner, which generates the new relation to be used in place 
of r in the next iteration. The loop continues until the predicate becomes true. 

From the algebraic point of view, the fixedpoint operator, denoted by ip, is 
a unary operator computing a result which has the same schema of the operand 
(but, in general, different instance). The generic form of the fixedpoint operation 
applied to a complex relation r is the following: 

ip{A^T,(r)- .Fp(Ar); T,{A,r)\(j) (102) 

where the involved functions Tt, Tc-, and iFp are specified within square brackets, 
between the operator symbol and the operand. The semantics of the fixedpoint 
operator can be specified by the pseudo-code of a FixedPoint function, which 
takes as input the operand r and computes the result r' , as illustrated in Fig. 3. 
The meaning is the following: r is the name of the (p operand and r' is a copy of 
r. First, r' is taken as the argument of the transformation function Tt (possibly 
with other arguments) that calculates the partial value A. Before going to the 
combination function, the predicate iFp(A,r') is calculated. This predicate may 
possibly contain further arguments beside A and r'. If the predicate evaluates 
to true, the loop is broken and the current instance of r' is the result. 

The quantification level of the predicate iFp must be related with the quan- 
tification level of the operand according to the axioms given in Sect. 4.1. If 
the predicate iFp(A, r') evaluates to false, the result of the combination function 
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Table 28. Query result of Example 36 



product 

cylinder 

cylinder 

cylinder 

cylinder 

piston 

piston 

piston 

piston 

valve 

valve 

valve 

valve 

carburetor 

carburetor 

tyre 

tube 



iFc(Z\, r'), possibly with further arguments, is assigned to r', and the body of the 
loop is repeated When the loop terminates, the instance of r' is the returned 
result. 

Example 36. Consider the complex relation Products displayed in Table 24. We 
might be interested in finding out the multiset of atomic components needed to 
assemble a multiset of given products. For example, to assemble two engines we 
need four cylinders, four pistons, four valves, and two carburetors. 

The multiset containing the products for which we ask atomic components 
is called Input, which is characterized by a single attribute called product. For 
example. Input might include two engines and one wheel. Here is our query: 

Input ^ [(engine), (engine), (wheel)] 

Lp[ Comps pproduct(U^omponents (Input N (T(Product s) ) ) ) ; 

Comps = 0; 

Comps U (7rproduct(CTcomponeiits=0( Input M (r(Products))))) 

] (Input) 



The first statement simply instantiates the multiset Input. The actual query is 
expressed by the next expression, where the fixedpoint operator is applied to 
Input. According to the terminology introduced in Formula (102), the trans- 
former first makes a natural join between Input and the multiset mirror of 
Products, then makes the complex union of the relevant components, and fi- 
nally performs a renaming of the result. This way. Comps is expected to contain 

® Note that the function will not terminate if the condition Tp{A,r') never evaluates 
to true. 
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the multiset of components (at the first level) of the products in Input. The 
predicate tests for the emptiness of Comps. Then, the combiner makes the union 
of Comps and the atomic products contained in the current instance of Input. 
The result of if is displayed in Table 28. □ 

6 Related Work 

Various database systems and query languages support multisets in the relevant 
data model [15, 47, 28, 45, 36, 44, 23, 12, 10]. For instance, a query written in 
DAPLEX [45] supports a flexible control over creation and elimination of du- 
plicates in the query result. Therefore, starting from the 1980s, multisets were 
proposed as an additional type constructor in order to formally deal with dupli- 
cates in databases. 

Some research has been devoted to extending the set-theoretic operations to 
multisets, as well as to developing techniques for algebraic query optimization 
for data models supporting multisets. The work presented in [20] extends the re- 
lational model to include multiset relations, i.e. relations with duplicated tuples. 
A framework for query optimization is also defined. Both the operands and the 
results of queries are in general multiset relations. 

A different approach to formalize multisets in relational databases can be 
found in [31], wherein, at the conceptual level, a relation with duplicates, called 
multirelation, is conceived as a subset of the columns of a larger relation with no 
duplicates, by keeping hidden those columns that yield the uniqueness of all the 
tuples in the relation. Consequently, relations or views with duplicated tuples 
are not allowed at the conceptual level, and there is no need for the introduction 
of any explicit multiset semantics. 

In [27] it is investigated how the use of multisets in the nested relational 
algebra extends its expressive power and increases its computational complexity. 

In [49] a framework for algebraic query optimization is developed in the 
context of an object-oriented data model that supports multisets. In the quoted 
work a number of specific algebraic rules for query transformation are presented. 
The emphasis is posed on rules for manipulating the instances of types in a super- 
type/sub-type lattice, with complex type constructors. Union, intersection, and 
difference are defined for multisets, and some transformation rules for these 
operators are given. 

The work described in [38] deals with recursion and aggregates according to a 
proposed declarative multiset semantics for deductive databases. Such databases 
have been enriched with aggregate operators and predicates that may return mul- 
tisets of tuples. A formal basis for efficient evaluation of queries when multisets 
are generated as intermediate results in a query evaluation process is provided. 

6.1 Multisets and Views 

Given a relational database, views are data derived from it that can be mate- 
rialized (that is, stored permanently in the database) and subsequently queried 
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against. The use of views in order to answer queries is common to different 
database areas and it is very relevant in data warehouse applications [32]. In 
fact, in such applications, queries against data typically involve the computa- 
tion of aggregate operations, which, as known, have to handle duplicated data 
to be correctly evaluated. 

In [46] the problem of transforming an SQL query that includes grouping 
and aggregation into an equivalent one, that can be answered more efficiently, 
is considered. Such a rewriting has to fulfill two requirements: 

1. The rewritten query has to be posed against available materialized views 
(conditions under which this is possible are given); 

2. The multiset semantics of queries has to be preserved, that is, the rewritten 
query computes the same multiset of tuples for any given database. 

The problem of view maintenance occurs when some of the base relations change 
and, consequently, materialized views must be recomputed to ensure that an- 
swers to queries posed against the views be correct. The recomputation of a 
view, after a change has occured in base relations, is usually computationally 
expensive. An alternative approach to complete recomputation consists in de- 
termining the changes that must be applied to the view, given the changes to 
base relations and the expression that define the view. In [25] an algorithm for 
view maintenance that follows this alternative approach is proposed. Views are 
regarded as multisets of tuples defined by means of expressions of a multiset 
algebra. The algorithm for view maintenance gives solutions that satisfy some 
minimality requirements, and, moreover, that are proven to be correct with re- 
spect to the aggregate operations computed over the changed view. 

6.2 Multisets in Web Information Discovery 

Weh data mining [35] is defined as the discovery and analysis of useful knowledge 
from Web data. An approach to Web data mining is proposed in [7], according 
to which queries are posed against the Web by specifying an entry point Web 
address and some conditions about paths and data attributes. Once retrieved, 
data matching the query are stored in a special relation. Each tuple of such a 
relation consists of interconnected directed graphs, which represent Web docu- 
ments and hyperlinks. In order to isolate the columns of interest from a table, a 
Weh projection operator can be used, which does not eliminate identical tuples 
automatically. The result of a Web projection is a Web bag, that is a relation that 
may contain multiple occurences of the same tuple. A few differences between 
Web bags and multisets in relational databases can be identified: 

1. Different nature of data: a Web bag contains unstructured data derived from 
the Web, while each piece of data in a multiset within a relational database 
has a known domain; 

2. Different purposes: Web bags are useful for discovering knowledge, while 
multisets in relational databases mainly focus on efficiency and on semantics 
of aggregate operators; 
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3. Different origin: Web bags can only be obtained as the result of a projection, 
while multisets in relational databases can be obtained also as the result of 
other operations. 

The exploitation of Web bags for discovering useful knowledge from Web data 
is discussed in the same contribution. 

6.3 Query Languages for Multisets 

In this section we will describe in some detail two meaningful works in the area 
of database query languages for multisets. The first [5] is a conference paper that 
historically opened the way to a research stream. The second work [33, 26, 34], 
carried out over several years, moved many further steps in this stream. In both 
works the name hag is preferred rather than multiset and we will adhere to this 
preference. In each description we will retain the notation used by the authors of 
the considered works. Different authors adopt different notations, that is, they 
may use different symbols for denoting the same operator or, vice versa, they 
may use the same symbol for denoting distinct operators. Thus, when faced with 
algebraic properties, the reader has to recall the context-specific meaning of the 
involved operators. In order to help the reader, while providing any operator 
definition given in a work, we have explicitly stated the correspondence of the 
current operator with operators defined by other authors and/or introduced in 
previous sections. 



Algebraic properties of bags. Albert [5] defines some bag operations and 
investigates to what extent the typical algebraic properties of set operations are 
obeyed when bags are assumed as operands. The proposed semantics of operators 
agree with [20]. Nested bags are not dealt with. The ultimate goal is to provide 
a formal basis for database query optimization. 

Constants. The notation [ list of elements ] is used for representing a bag, where 
each element belongs to a countable set of primitive objects. The symbol 0 
denotes an empty bag. 

Membership and multiplicity. The expression x & B denotes that bag B contains 
X, and X SS B represents the number of copies of x in B. li x & B, then 
X GG B > 0, and vice versa. For example, given bag B = [a, a, &, 6, 6], both 
a G B and b G B hold; in particular, a GG B equals 2, and b GG B equals 3. 

Containment. Given two bags A and B, by definition 

1. AC B means that (Vcc G A) ((x GG A) < (x GG B)); 

2. A C B means that AC B and Ay^B. 

In either case, A is a subbag of B. A theorem is proven, according to which C, 
as defined above, is a partial order relation (i.e. it is an irrefiexive transitive 
relation) on any set of bags. 
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Powerset. Given a bag A, by definition, the powerset of A is 

V{A) {B\B CA} . (103) 

Union and intersection. Let A and B be bags. By definition 

1. A U B is the smallest bag C such that ^ C C and B C C; 

2. A n B is the largest bag C such that CCA and C C B. 

The notion of largest and smallest used in the above definitions refer to the 
partial order relation C. 

The definitions of U and H yield the usual set union and intersection when 
restricted to sets. Besides, these operations, as defined on bags, share the same 
algebraic properties as the standard set-theoretic ones. This is proven by means 
of a theorem which states that, given a bag U, {V{U),U, n) is a distributive 
lattice, called subbag lattice, induced by the partial order C. This means that, 
for the binary operations U and D, as applied to any possible subbag of a given 
(universal) bag U, all the axioms of Boolean algebra wherein complement is not 
referenced hold, with U corresponding to one and 0 to zero. 

Note that the union operation, as defined above for bags, is neither that 
defined in Sect. 3.1 nor that adopted by SQL for multisets. As to the intersection 
operator, it corresponds to both the intersection as defined in Sect. 3.1 and the 
SQL operator INTERSECT ALL (see Sect. 3.4). 

By means of a theorem, it is shown that the above operations are well-defined, 
that is, for any given pair of bags, there is a unique bag satisfying the definition 
of union/intersection of such bags. In particular, given three bags A, B, and C, 
for union it is proven that 

1. X AU B = m.a,x{x GG A,x GG B); 

2. ACCandBCC^AcBCC. 

Likewise, for intersection it is proven that 

1. X GG An B = min{x GG A,x GG B); 

2. C C A and C C B ^ C C An B. 

Concatenation. Given two bags A and B, and the countable set of primitive 
objects Obj that is the domain of any element of A and B, by definition the 
concatenation of A and B, AU B, is the bag C such that 

xGGC ={xGG A) + (x GG B) . (104) 

This operator becomes the usual set union when bags are turned into their set 
counterparts. SQL use the keyword UNION ALL for this operator. 

Difference. Let A and B be bags, and Obj the countable set of primitive objects 
that is the domain of any element of A and B. By definition, the difference A\B 
is the bag CCA such that 

(Vx G Obj) {x GG C = max((x GG A) — (x GG B),0)) . (105) 
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This operator matches the usual notion of set difference when restricted to sets. 
It corresponds to the difference operator for multisets defined in Sect. 3.1 as well 
as to the EXCEPT ALL operator of SQL (see Sect. 3.4). However, taking any bag 
U that is not a set as a universe, this operator, if used as a unary operator, does 
not induce a complement operator relative to this universe. Operator \ would 
be a unary complement operator if it would satisfy the following two axioms: 



(VH e V{U)) {Au{\A) = U) 
{\/AeV{U)) (Hn(\H) = 0) . 



(106) 



However, these two axioms does not hold for operator \ as defined above. Still 
more, Albert demonstrates a theorem according to which no such an operator 
exists and, therefore, looking for a different semantics for bag difference cannot 
solve the problem. 

Given a complement operator defined on a universe and two operators that 
form a distributive lattice on the same universe, the three operators altogether 
are a Boolean algebra. Thus (7^(17), U, D, \) is not a Boolean algebra for bags, 
whereas it is for sets. Therefore, while all the properties of a Boolean algebra 
which involve the difference operator hold for sets, there are some of them which 
fail for bags, as illustrated in the following example, taken from [5]. 

Example 37. Let’s consider the following algebraic transformations inherent to 
the set-theoretic operations of union, intersection, and difference: 



A\{B\JC)= Ar\{B\JC)= Ar\{Bf]C) = {Ar\B)f]C ={A\B)\C . 

(107) 



The equivalence of set difference and the intersection with the complement has 
been exploited in the first and last transformations, in the second transformation 
De Morgan’s law has been applied, and, in the next one, the associativity of 
intersection. 

The identity obtained between the initial and final expressions holds if A, 
B, and C are sets, however, in general, it does not hold for bags. In fact, let’s 
suppose, for instance, that A = [x,x,x,x], B = [x,x], and C = [x\. Then 
A \ (H U C) = [x, x], while {A\B)\C = [xj. □ 



Duplicate elimination. A bag B is transformed into a set by function 6. By 
definition, 6{B) = {x \ x G B}. 6 looks like the opposite function of the mirror 
function 9Jt defined in Sect. 3.2. 



(Boolean) selection. Let A be a bag, Obj the domain of its elements, and ip a 
quantifier-free predicate, either atomic or built up from a set of atomic pred- 
icates, whose domain includes i5(A). The selection of A based on ip selects all 
and only the elements of A which satisfy the predicate ip, that is, by definition, 
cr.^{A) is the bag C such that 

. ( 108 ) 




204 Gianfranco Lamperti, Michele Melchiori, and Marina Zanella 



This operator is the same as the selection operator for multisets defined in 
Sect. 3 . 1 , which can be expressed in SQL by means of the select-from- where 
paradigm (see Sect. 3 . 4 ). 

A theorem [5] states that, given any predicates ip and tp whose domains 
include 5(A), the following equalities hold: 

f^y?V(/7(A) (j,^(A) U (j(^(A) 

f^i/j(A) n (j(^(A) 

a^^p{A) = A\a^p{A) ( 109 ) 

S{<7^{A)) = acp{S{A)) 

X GG CT(p{A) = (x GG A) ■ (x GG cr^(5(A))) . 

The first three properties above imply that the semantics of V, A, and ^ with 
respect to Boolean selection correspond to U, fl, and \, respectively. 

The next two properties formally state the all-or-nothing semantics of the 
selection operator, that is, either all copies of some x G A are selected, or none. 
The fourth property exhibits a slight abuse of notation on the right-hand side 
since the Boolean selection operator defined over bags is applied to a set. Indeed, 
Albert makes no clear distinction between a set and a bag containing just an 
occurrence of everyone of its element. 

Albert proves a theorem stating that, with respect to selection operations, 
bags behave like sets. Formally, let Ei and E 2 be two compositions of selection 
operators, that is, E\ = o • • • o and E 2 = o • • • o Oip^, . If, for 
every set A, Ei{A) = E 2 {A), then also Ei{B) = E 2 {B) for every bag B. 

Complement. Let 17 be a bag and Bred the smallest set of predicates that 
includes a given set of atomic predicates, and is closed under the propositional 
connectives. Let S{U) be the set of subbags of U obtained by applying any 
selection operator to U, that is, 

S{U) = {ACU :{3ipG Pred){A = ap,{U))} . ( 110 ) 

The unary complement operator — is defined on S(U). By definition, for any 
A G S{U), —(A) = U\A. Albert proves that {S{U), Li, D, — ) is a Boolean algebra. 

Operators and query languages. By means of the following properties, Albert 
proves that U and \ are sufficient to obtain U and n. 

AUB = {A\B)UB 
AuB = {AuB)\{AnB) 

AnB = A\{A\B) ^ ^ 

Af]B = (AUB)\{AU B) . 

The converse, instead, does not hold, that is, neither U nor \ can be constructed 
out of the three remaining operations, as stated by means of two theorems. 

The implication of these results is that U and \ are strictly more general 
operations than U and n. This justifies the presence in SQL of UNION ALL and 
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EXCEPT ALL, corresponding respectively to operators U and \, along with the 
absence of any operator corresponding to bag union U and intersection H (as 
defined by Albert). 

Since a Boolean algebra U, D, — ) has been defined on the set of the 

results of any Boolean selection as applied to some universal bag, the usual 
algebraic properties for sets hold within this domain. In particular, the algebraic 
transformations that are applied to sets for query optimization keep being valid 
when the operands are bags which are formed by applying a Boolean selection 
to some universal bag. 



Properties of query languages for bags. The research described in [33, 26, 
34] focuses on how database query languages are affected by the use of duplicates. 

Many query languages have traditionally been developed based on relational 
algebra and relational calculus. Relational algebra is the standard language for 
sets, and (in its ffat form) an algebraization of first-order logic. However, struc- 
tures such as bags, along with nested structures in general, are not naturally 
supported by first-order logic-based languages. Thus, in the work at hand an 
approach to language design was exploited [11] that, instead of using first-order 
logic as a universal platform, suggests to consider the main type constructors 
(such as sets, bags, and records) independently and to turn the universal proper- 
ties of these collections into programming syntax. This approach was adopted for 
the development of a query language with the aim of investigating the theoreti- 
cal foundations for querying databases based on bags. The expressive power and 
complexity of this language, which can be regarded as a rational reconstruction 
of SQL, and its relationships with both relational algebra and a nested relational 
algebra [3] have been investigated. 

The operations defined in this work are intended for flat bags (bags of tuples 
with attributes of basic types) as well as for nested bags (where tuple attribute 
values can also contain nested bags). 

Types and domains. A distinction between types and domains is made. [Ti, . . . , 
T„] is a tuple type, whose domain is the set of tuples over Ti,... ,T„, that 
is, Dom([Ti, . . . ,T„]) = Dom(Ti) x . . . x Dom(T„). {[T]} is a bag type, whose 
domain is the set of finite bags of objects of type T. 

Operations on records and tuples. The tupling function r transforms a record 
into a tuple. Formally, t(oi, . . . , Ok) = [oi, . . . , Ok], where oi, . . . , 0 k are k at- 
tributes of type Ti, . . . , Tfe, respectively, and [oi, . . . , o^] is a k-ary tuple, of type 
[Ti, . . . , Tfc], containing Oi (t = 1 . . . fc) in its i-th attribute. 

The attribute projection function ai returns the i-th attribute of a given tuple. 
Formally, ai([oi, . . . , 0 k]) = Oi. 



Membership test and multiplicity. An element n-belongs to a bag if it belongs to 
that bag and has exactly n occurrences. Function member of type T x {]T|} — > 
Boolean returns true on a pair (o, B) iff o p-belongs to B, p > 0. 
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In other words, in order to relate [26], [5], and Sect. 3.1, given a bag B and 
an element 6, 

1. h n-belongs to B b ££ B = n Occ{b, B) = n; 

2. member{b, B) = true b G B. 

Constants. {[•, ■ . . , •[} is a bag; constant {||} denotes the empty bag, even if also 
symbol 0 is used very often with the same meaning. 

Bagging (or bag singleton). Given an element o, by definition /3(o) = {joj} is a 
bag containing o as a single element, i.e. o 1-belongs to (3{o). 

Additive union. Bags are built by using the additive union operation l±l starting 
from the empty bag and singleton bags. That is, each bag is either {||}, or a 
singleton {jccj}, or the additive union of two bags B^B' . Given two bags B and 
B' of type {|T|}, by definition B^B' is a bag of type {|T|} such that o n-belongs 
to B l±l i?' iff o p-belongs to B and g-belongs to B' and n = p + q. 

This operator is the same as the concatenation operator U defined by [5], 
here extended also to nested bags. 

Extension. If / is a function of type T ^ {|7"^|}, the extension of / extends / 
to a function of type {|T|} — > {|T'|} as follows: 

EXTf{{\xi,... ,Xn\} = f{xi)^...^f{xn) . (112) 

MAPg is a concise form for EXT^og, where (3o g is the composition of function 
[3 (bagging) and function g, that is, (/3 o g){d) = (3{g{d)). 

Cartesian product. If B and B' are bags containing tuples of arity k and k' , 
respectively, by definition B x B' is a bag containing tuples of arity k + k' such 
that o = [oi, . . . , Ok, Ofc+i, . . . , Qk+k'] n-belongs to B x B' iff oi = [oi, ... ,Uk] 
p-belongs to B, 02 = [ofe+i, . . . , Ofc+fc'] g-belongs to B' and n = p ■ q. 

The Gartesian product operator is the same as the product for multisets 
defined in Sect. 3.1, which can be expressed in SQL by means of the select-from- 
where paradigm. Such an operator was naturally extended to nested multisets 
in Sect. 4.1. 

Subtraction. Given two bags B and B' of type {|T|}, by definition B — B' is 
a bag of type {|T|} such that o n-belongs to i? — iff o p-belongs to B and 
g-belongs to B' and n = max(0,p — q). 

This operator, called also difference [34], is the same as the difference oper- 
ator \ defined by [5] and outlined in Sect. 3.1. Such an operator was naturally 
extended to nested multisets in Sect. 4.1. 

Maximal union. Given two bags B and B' of type {|T|}, by definition B LI B' 
is a bag of type {|'T|} such that o n-belongs to B U i?' iff o p-belongs to B and 
g-belongs to B' and n = max(p, q) . 

This operator is the same as the union operator U defined by [5], here ex- 
tended also to nested bags. 
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Intersection. Given two bags B and B' of type {|T|}, by definition B f] B' is 
a bag of type {|T|} such that o n-belongs to B n iff o p-belongs to B and 
g-belongs to B' and n = min(p, g). 

This operator, also called minimum intersection [34], is the same as the in- 
tersection operator n defined by [5] and outlined in Sect. 3.1. Such an operator 
was naturally extended to nested multisets in Sect. 4.1. 

Duplicate elimination. Given a bag B of type {|T|}, e{B) is a bag of type {[Tj} 
containing exactly one occurrence of each object of B. Formally, an object o 
1-belongs to e{B) iff o p-belongs to B for some p > 0, and 0-belongs to e{B) 
otherwise. 

Function e, also called unique [34], becomes the identity function when it is 
turned from an operation on bags into its set analog. 

Equality test. Function eq of type T xT ^ Boolean returns true on a pair (o, o') 
iff o and o' are equal objects. 

Subbag test. Function subbag of type {|T|} x {|T|} ^ Boolean returns true on 
a pair {B, B') iff whenever o p-belongs to B, then o p'-belongs to B' for some 

p' > p. 

In other words, relating [26] and [5], subbag{B, B') = true ^ B C B' . 
However, here the subbag test is inherent also to nested bags. 

Complexity and expressive power. Gomplexity and expressive power are consid- 
ered with respect to the language defined by operators r, Oj, [3, l±l, EXT, x, 
together with the empty bag constant. From the complexity point of view, each 
of the subtraction, union, intersection, duplicate elimination, subbag, member- 
ship, and equality test operators has polynomial time complexity with respect 
to the size of the input [34] . From the expressive power point of view, the same 
operators are related as follows [26, 34]: 

1. — can express all primitives other than e; 

2. e is independent of the rest of the primitives; 

3. n is equivalent to subbag and can express both U and eq-, 

4. member and eq are interdefinable, both are independent of U, and each of 
them, together with U, can express H. 

Thus, the strongest combination of primitives among — , U, fl, e, subbag, member 
and eq, is — and e. 

Bag query languages. Operators r, ai, f3, l±l, EXT, x, — and e, together with 
the empty bag constant, are the primitives of BALG (standard bag algebra) [26], 
also referred to as BQL (bag query language) in [34]. 

BALG can express many operations commonly found in database languages. 
For instance, M APxx\a.2(x),(^30] denotes the projection of a tuple type (the type 
of the elements of the bag which is the operand) on its second and third argu- 
ments. Following [26], for the sake of brevity, the map projecting the attributes 
i\,. . . ,in will below be denoted by 
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Fig. 4. The directed graph of Example 39 



Likewise, MAPxx.ai{x)=a denotes the selection within a bag of tuples of all 
and only the tuples wherein the i-th argument equals a. A shorthand for this 
query is ai^a- 

Assumed to represent an integer i by means of a bag containing i occurrences 
of the same element (and nothing else), BALG allows the definition of several 
fundamental database primitives. For example, bags can be used to simulate 
SQL aggregate functions, such as SUM and COUNT. In fact, if B is a bag of tuples, 
COUNT(B) = 7Ti({|a|} x B). 

Example 38. A relation can be represented in BALG by means of a bag with no 
duplicates, such as R = {|7,5,3,9,4,2|}. The parity of the cardinality of a rela- 
tion becomes definable in BALG in the presence of an order on the domain. The 
following Boolean expression is true iff the parity of the cardinality of relation 
R is even: 



'^Aa;.(M4P[£,] (o->„,(„<a,)P) = (M4P[„] {(y\y.(x<y)R ,)(«)# {ID- (113) 

In fact, the inequality holds only if there exists an x such that the number of 
elements smaller than or equal to x equals the number of elements strictly bigger 
than X. For instance, the above query can check the parity of the sample relation 
R by exploiting the order of integers. Based on Table 29, it emerges that the 
result of the left-hand side is {|4|}, thus the query gives true. 

This example is quite simple as it involves a relation among integers. However, 
the same considerations hold also for relations among tuples, provided that there 
is at least one attribute whose domain is ordered. □ 

Example 39. Gonsider a directed graph whose edges are recorded in a binary 
relation G, such as the one displayed in Table 30 inherent to the graph of Fig. 4. 

Formally, in BALG a graph is an object of type {|t x t|}, where t is a base 
type with a countably infinite domain of uninterpreted constants. For instance, 
G is represented by the bag of tuples {|[r, a], [r, h], [o, d], [a, e], [a, /], [b, /], [b, g]]}. 

Note that the domain of a graph is unordered, since the domain of t is 
unordered, that is, only equality test is available for constants of type t. 

The following is a Boolean query on the graph, that is, a query defined on 
{|t X f|} ^ Boolean, 

(^2({T2=a(G)))-(^l(ai.,(G)))^{||} . 



(114) 
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Table 29. Parity of the cardinality of an ordered relation represented as a bag. 



X 


^Xy.(y<x) R 


^ iS^\y.{y<x)R') 


^Xy.(x<y)^ 




T 


{[7,5, 3,4,2]} 


{IH. H. H: [a]> HI} 


{|9|} 


{IHI} 


5 


{[5,3, 4,2]} 


{|[a],[a],H,[a]|} 


{|7,9|} 


{IH,HI} 


3 


{|3,2|} 


{IH,HI} 


{|7,5,9,4|} 


{|[a],[a],H,[a]|} 


9 


{[7,5, 3, 9, 4,2]} 


{[[a], H, [a], [a], [a], [a]]} 


{!!} 


{!!} 


4 


{|3,4,2|} 


{IH,H,HI} 


{|7,5,9|} 


{IH,H,HI} 


2 


{|2|} 


{IHI} 


{[7,5,3, 9,4]} 


{[[a], H, [a], [a], Hi} 



It gives true if! the in-degree of a node a is bigger than its out-degree. 

In our sample case, the following equalities hold: 7T2{t72=a{G)) = 7T2({|[r, a]|}) 
= {INI}. and 7ri((Ji=o(G)) = 7Ti({|[a, e], [a, /]}) = {|[a],[a]|}. Thus, Equa- 
tion (114) becomes {|[a]|} — {|[a], [a]|} yf {||}, which gives false. □ 

BALG vs. relational algebra: complexity. As stated in [4], relational algebra en- 
joys an AGO data complexity upper-bound and so does nested relational algebra 
[48]. AGO [24] offers potential for efficient parallel evaluation. However, BALG 
is not in AGO: its authors state that BALG is contained in LOGSPAGE. 

BALG vs. relational algebra: expressiveness. A comparison of the expressive 
power of BALG to that of relational algebra is provided in [26]. Since BALG 
considers complex (nested) objects as well as flat relations, the relationship be- 
tween BALG, when applied to complex objects, and nested relational algebra is 
also investigated. 

Let arithmetic include the type nat of natural numbers, together with the 
operations of addition, multiplication, modified subtraction — (i.e. n — m = 
max(0, n — m)), and summation Bf. Here / is of type T nat, and Bf of type 
{T} ^ nat with the semantics Bf{{xi , . . . , Xk\) = f(xi) -|- . . . -I- f{xk). 

A theorem states that BALG when restricted to flat bags is equivalent to 
relational algebra -I- arithmetic, and BALG over nested bags is equivalent to 
nested relational algebra -I- arithmetic. The comparison between BALG and 
nested relational algebra is restricted just to nested bags since it was shown 
[50] that nested relational algebra has no extra power with respect to relational 
algebra over flat relations in the sense that any nested relational algebra query 
from flat relations to flat relations can be defined in relational algebra. 

Basically, BALG is more powerful than relational algebra and nested rela- 
tional algebra since bags give a counting power. For instance, the result presented 
in Example 39 is quite important, as, while BALG can test the parity of rela- 
tions over ordered domains, while relational algebra cannot test parity on any 
(ordered or unordered) domain. 

However, some fundamental limitations on the expressive power of first-order 
logic, which are typical of relational algebra and nested relational algebra, still 
hold in BALG. For instance, consider the query bag-even, which tests the parity 
of the number of duplicates in a bag. Formally, given a bag B of type {|T|} and 
an element c belonging to Dom(T), bag-even(B,c) = true if c’s multiplicity in B 
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Table 30. The binary relation G representing the directed graph of Fig. 4. 



Arrow tail node 


Arrow head node 


r 


a 


r 


b 


a 


d 


a 


e 


a 


/ 


b 


/ 


b 


9 



is even, and false otherwise. A proposition, which is claimed to be valid for both 
flat and nested bags, is stated in [26], according to which the query hag-even 
is not expressible in BALG. More generally, it is stated that a property of the 
number of duplicates of a single constant can be tested in BALG only if such a 
property (or its complement) holds for a finite number of bags. 

Besides, a theorem states that, for any two graphs G and G', each of which 
consists of a number of disconnected simple cycles having the same length, there 
exists a k such that, if the lengths of the cycles of both graphs are longer than 
k, then, for any Boolean query q, q{G) = q(G'). This means that the two above 
defined graphs cannot be distinguished based on the answers to BALG queries. 
From this, [26] concludes that many recursive queries are not definable in BALG 
over unordered domains: parity of the cardinality of a relation, transitive closure, 
deterministic transitive closure, testing acyclicity, testing connectivity, testing 
for balanced binary trees. A proof of the undeflnability in BALG of parity test, 
transitive closure, and test for balanced binary trees is provided in [34]. 

Several attempts to extend the power of BALG to support the above un- 
deflnable queries have to be registered. In this respect, three further operators, 
introduced in [33] and listed below, have to be considered. 

Powerhag. Given a bag B, the powerbag operator returns the bag of all subbags 
of B. For instance, 

porcer6ag({|l,l,2|}) = {|{||},{|l|},{|l|},{|2|}, 

{|1,1|},{|1,2|},{|1,2|},{|1,1,2|}} . (115) 

Powerset. When applied to a bag B, the powerset operator returns the bag of 
all subbags of B, each with multiplicity 1. For instance, 

pozcerset({|l,l,2|}) = {|{||},{|l|},{|2|},{|l,l|},{|l,2|},{|l,l,2|}} . 

(116) 

This operator is the same as the homonymous operator defined in [5]; however, 
here it is extended to nested bags. 

Loop. Gonstruct loop takes as input a function / of type T ^ T, an object o of 
type T, and a bag B = {|oi, . . . ,o„|} of any type {]T'|}, and returns / applied 
n times in cascade to o (where n is the cardinality of B). 




On Multisets in Database Systems 211 



Power and complexity of extended BALG. It was shown in [33] that powerbag 
and powerset are inter definable. However, even if all the queries which are not de- 
finable in BALG over unordered domains become definable in BALG+ powerset, 
they are not definable efficiently. Out of efficiency BALG+powerbag has to be 
preferred. The complexity of BKLG+powerset and BKLG+powerbag was studied 
in [27]. 

The loop operator was introduced as an alternative to the structural recursion 
operator. The main problem with using structural recursion is that it requires 
some preconditions on its parameters for well-definedness. However, such pre- 
conditions are generally undecidable [9] . In [33] it was shown that nested BALG 
with structural recursion is equivalent to nested BALG with loop. 



7 Conclusion 

This paper has shown how the notion of a multiset affects the data model of 
database systems and the relevant query languages. Since most current database 
systems are relational, the focus is on the relational data model and on relational 
algebra. 

The relational data model is based on the central concept of a relation. 
Intuitively, a relation is a set of tuples, where each tuple is, in turn, a collection 
of atomic values, one for each attribute of the relation. A database consists of one 
or several relations. Once a relational database has been created, it is possible 
to retrieve and modify the stored information by means of appropriate query 
languages. Among formal query languages for relational databases is relational 
algebra, which deals with the manipulation of sets. This means that each query 
result is expected not to include duplicates. 

Even if, from the theoretical point of view of relational algebra, the result of 
a query is a set, from the practical point of view it is sometimes convenient to 
have multisets as results of queries, in particular if the information they include is 
intended to be processed by aggregate functions. Thus, practical query languages 
for relational databases, typically based on the standard query language SQL, 
allow the manipulation of relations in a multiset-oriented way. Based on the 
above considerations, relational algebra operators have been straightforwardly 
extended in order to manage multisets, thus obtaining some relational algebras 
for multisets. 

Historically, the awareness has gradually grown that the relational data model 
is too simple to express in an intuitive way complex data, as those typically 
encountered in non business applications, and relationships among such data. 
Thus, several extensions to the relational model have been proposed as well 
as a number of extended relational algebras. The most noteworthy extension 
of the relational model, called nested relational model, was the transformation 
of usual (fiat) relations into nested relations, where each attribute value is not 
necessarily atomic, rather, it can be a (nested) relation itself. The algebraic 
counterpart of the nested relational model is a nested relational algebra, which. 




212 Gianfranco Lamperti, Michele Melchiori, and Marina Zanella 



although manipulating only sets, both extends relational algebra operators to 
nested relations and introduces new operators. 

Roughly, two research streams have been focused on the extension of re- 
lational algebra. On the one hand, flat relations have been extended to flat 
multisets. On the other, flat relations have been extended to nested relations. 
The two approaches have been integrated within the notion of a complex rela- 
tion, that is, a multiset of tuples where each attribute is possibly a (complex) 
relation. A nested relational algebra for complex relations has been defined by 
combining the concepts inherent to the relational algebra for multisets and the 
nested relational algebra. 

Moreover, the paper hints at some research topics inherent to multisets in 
database systems. In the last decade, there has been some activity in trying to 
identify a standard formal query language for multisets, which may play a role 
similar to that of relational algebra. The first requirement of such a language 
is to support aggregate functions, and the second is to deal with nested data. 
Processing and optimization of queries in the proposed languages is an active 
research area. However, investigating the expressive power of multiset languages 
involves many difficulties. In fact, in order to cope with aggregate functions, 
multiset languages support built-in arithmetic and, therefore, it is hard to find 
a logic that captures them and that can be exploited for proving results about 
the expressive power. As to nested multiset languages, they involve calculi that 
are essentially higher-order logics, whose expressibility properties are rather un- 
known. 
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Abstract. A multiset involves an equivalence relation between the copies 
of the same element. However, in many cases the binary relation relating 
an element to its copies is not exactly an equivalence one, but a weaker 
relation, in most cases a tolerance (i.e., reflexive and symmetric) relation, 
leading to a natural extension of multisets: tolerance multisets. 



1 Behind Multisets Is a Relation of Sameness 

Let us consider various versions of the definition of a multiset, [2,7,8,9,18,19,20]: 

(1) a set in which every element occurs a finite number of times; (2) a set in 
which every element exists in a finite number of copies; (s) given a set D and 
the set N of non-negative integers, a multiset over H is a function / from D 
into N, or (4) a pair {D, /), where / is a function from D into N; (5) a multiset 
over D corresponds to a (finite) string over D. For the way the multisets occur 
in membrane computing, see [16]. 

In all these versions, more or less rigorous, of the definition of a multiset, a 
common, unaivodable feature is to assume the existence of a binary relation of 
sameness between some objects. In version (1), the expression “finite number of 
times” suggests that we can order in time the objects we are considering and we 
can realize that some different occurrences refer to the same object. In version 

(2) , the idea that an object is a copy of another one relates to the same relation of 
sameness, although the word “copy” may suggest that it refers to another object, 
which is the original one, to which the copy is in a relation of dependency. In 
version (3), we recognize the extension of the idea of a characteristic function, 
used to define a set; but saying that, for the object x in D, f{x) is equal to 
n, where n is larger than one, means to assume the same relation of sameness 
between n different occurrences of x. In version (4), the situation is similar. In 
version (5), it is obviously assumed that there is a sharp distinction between 
the different elements of the alphabet D (as a matter of fact, this is the basic 
assumption for any alphabet) and that the same element of D may occur in a 
string several times, with no upper bound other than the length of the string. 

Multisets were called sometimes hags [20]; this last term is both shorter and 
more convenient, because “multisets” may be associated with another term, used 
in analysis and topology, “multifunctions” , denoting functions whose values are 
sets. Today however is too late to change the terminology in this respect, because 
“multiset” is quasi-generally accepted. 
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Version (5) considered above calls attention to the fact that multisets are 
only one among several types of data-structures we met in various information 
sciences. The terms of reference are order and repetitions. In a given arbitrary 
set, neither order nor repetition of the elements are relevant. In a list, both 
order and repetition are relevant; strings are lists. Multisets are less restrictive 
than lists, because only repetitions are relevant, but more restrictive than sets, 
which are completely free in respect to both order and repetition. It remains the 
possible case when only order is relevant: it is what happens with compact lists, 
which are lists in which contiguous occurrences of the same object are reduced 
to only one occurrence [2,18]. The extension we are proposing with respect to 
multisets can be considered for lists and compact lists too. 

2 Prom Multisets to Tolerance Multisets: 

The Case of Synonymy 

Examples of multisets that are not concerned by our extension are mainly those 
occurring in pure mathematics: the divisors of a positive integer p form a multiset 
where there is no reason for an extension to a tolerance multiset. The role of the 
set D belongs to the set of prime divisors of p. So, in the set of divisors of 72 we 
detect three occurrences of 2 and two occurrences of 3. Similarly, the roots of an 
algebraic equation such as — x + \ form a multiset in which 1 has the 

multiplicity equal to 2, while —1 has the multiplicity equal to 1. Here too, there 
is no reason for an extension to tolerance multisets. In both these examples, the 
relation between an element and one of its copies is clearly reflexive, symmetric, 
and transitive. 

The situation changes as soon as we move to multisets conceived as models of 
some natural or social phenomena. Coins of various values in U.S.A. form a mul- 
tiset and for every value we have a large number of copies. The relation between 
two coins of the same value is, generally speaking, an equivalence relation, but, 
if a coin of one dollar is too deteriorated, it may happen that some machines will 
not accept it. There is no sharp border between an accepted and a non-accepted 
coin, so what seemed to be, at a first glance, an equivalence relation is only a 
tolerance one. 

Another significant example is the relation of synonymy in linguistics. Roughly 
speaking, two words or expressions are synonymous if they have the same mean- 
ing. If we remain at this representation of synonymy, then the respective relation 
is clearly an equivalence and we can conceive the vocabulary of a language as 
a multiset in which the number of copies of a word is equal to the number of 
its synonyms. However, things are not so simple. Exact synonyms rarely (if not 
never) exist. This means that it is very difficult (if not impossible) to find words 
X and y such that for any context {u, v) for which uxv is meaningful, uyv is 
meaningful too, and they both have the same meaning, while the converse is 
also true: for any context {u, v) for which uyv is meaningful, uxv is meaningful 
too and they both have the same meaning. So, we weaken the idea of synonymy 
in the following way: x and y are in a relation of partial synonymy if there ex- 
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ists at least one context {u, v) such that uxv and uyv have the same meaning. 
But partial synonymy is no longer transitive, so it is no longer an equivalence 
relation, it is only a tolerance relation. Examples in English can be obtained by 
means of any monolingual English dictionary. Using [3], we find that ‘big’ is a 
partial synonym of ‘large’, of ‘important’ and of ‘boastful’; ‘large’ is a partial 
synonym of ‘great’, which, in its turn, is a partial synonym of ‘elaborate’ and of 
‘intense’. If we remain to what the dictionary tells us, we have to consider that, 
in respect to [3], ‘big’ and ‘great’ are not partial synonyms, although this fact 
cound seem strange. Any analysis of this type should be done in respect to a 
given dictionary, otherwise room is left to arbitrariness and subjectivity. 

In French, taking as term of reference the dictionary [4], we find that ‘grand’ 
is a partial synonym of ‘important’, ‘considerable’, ‘vaste’, while ‘important’ 
and ‘considerable’ are partial synonyms of ‘large’, which, in its turn, is a par- 
tial synonym of ‘important’ and of ‘considerable’, both of them being, as we 
have seen above, partial synonyms of ‘grand’. It follows that ‘grand’ and ‘large’ 
become partial synonyms only by mediation of ‘important’ and ‘considerable’. 
This mediated synonymy could suggest to define a relation of generalized syn- 
onymy by considering the reflexive and transitive closure of the relation of partial 
synonymy. In this way we get again an equivalence relation. 

But perhaps the most familiar example of a multiset is the bag in which, 
at the market, we introduce a certain number of apples, a certain number of 
potatoes, a certain number of oranges, etc. Beyond their common trait, to be, 
for instance, apples, the respective objects may be not at all similar, but there 
is no danger to confuse an apple with a potatoe or with an orange. Just this fact 
shows that in this case the extension from multisets to tolerance multisets is not 
necessary. 

3 A Tolerance Multiset in Phonology 

A more sophisticated situation appears in phonology. The basic concept of a 
phoneme is, in a first approximation, a class of equivalent sounds. Accepting 
this representation, the set of phonemes is a multiset of sounds; the phonemes 
are the types, while the sounds are the token. The only difficulty here is the 
problematic finiteness of the set of sounds associated with a given phoneme; and 
even in the case in which this set is really finite, its cardinal number cannot be 
practically evaluated. As a matter of fact, the situation here is more sophisticated 
and we will try in the following to describe what happens (see also [10] and [11]). 

Let us consider a set S of elements called abstract sounds and let F{S) be 
the free-monoid generated by S. Let L{S) be a subset of E(S'); L{S) is said to 
be a language over S. The elements of L{S) are well-formed strings over S. Let 
r be an equivalence relation in F{S). If for x,y,z in F(S) and x r y we have 
zx r zy and xz r yz, then r is said to be a congruence relation in F{S). Let 
R be a congruence relation in F{S), such that L{S) is closed with respect to R, 
i.e., if X is in L{S) and if x B, y, then y is in L{S). The triad (F(S'), L(S'), R) 
is said to be a phonological system; it is a phonological system of the language 
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L{S). The choice of L{S) and of the relation R has both syntactic and semantic 
motivations. 

Given the subsets X{1), X{2), . . . , X{n) of F{S), we denote by X(1)X(2) 

. . .X{n) the set of all strings of the form x{l)x{2) . . . x{n), where x{i) G X{i) 
for all 1 < z < n. 

Proposition 1. The free monoid D{X) generated by X is identical to the union 
of all sets X”, n > 0, where X” is the cartesian product of X with itself n times. 

Strings x and y are said to be in a relation of variation, i.e., x is a variant 
of y (or y is a variant of x), and we write x m y ii for any strings z and w the 
following implications are valid: (i) if zxw is in L{S), then zxw R zyw; (ii) if 
zyw is in L{S), then zxw R zyw. 

Proposition 2. If x m y and if x is in L{S) or y is in L{S), then x R y. 

Proposition 3. The relation of variation is a congruence relation in F{S). 

Proposition 4. The set L{S) is closed with respect to m. 

The relation m of variation corresponds to the free variation, well known 
from phonemics. It is known that two sonorous strings are considered as variants 
(allophones) of the same entity not only when they are in free variation, but also 
when they are in some situations of complementarity distribution. In order to 
include this possibility too, we adopt an extension k of the relation m: strings x 
and y are in relation k, we write xRy, and we say that k is a relation of variation 
in the broad sense of m if for any strings z and w such that zxw is in L(S) and 
zyw is in L{S), we have also zxw R zyw. We say in this case that x (resp. y) is 
a variant in the broad sense of y (resp. x) and we write x v y. 

We define now an equivalence relation s in F{S): if x and y are in F{S), we 
have X s y if and only if for two arbitrary strings z, w in F(S) either zxw and 
zyw are both in L{S) or zxw and zyw are both absent from L{S). 

Proposition 5. If we have u s v and x s y {u,v,x and y being strings in F{S)), 
then we also have ux s vy. 

Proposition 6. The relation s is a congruence relation in L{S). 

A string x is semi-wellformed with respect to L(S), if it is not wellformed, 
but there exist two strings y, z such that the string yxz is wellformed, i.e., yxz is 
in L{S). A string which is neither wellformed not semi-wellformed with respect 
to L{S) is said to be parasitic with respect to L{S). 

Proposition 7. The parasitic strings with respect to L{S) form a single class of 
s- equivalence. 

Proposition 8. If x is semi-wellformed with respect to L{S) and if y is in a 
relation of s- equivalence to x, then y too is semi-wellformed with respect to L{S). 

Proposition 9. If x is a variant of y, then x is a variant in the broad sense of 
y; the converse is not true. 
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Sometimes, the phoneme is defined as a set of sounds which are either in 
free variation or in complementary distribution; in orther words, strings x and y 
belong to the same phoneme ii x v y (because we never have both zxy and zyw 
in L{S), X and y are accepted in L{S) by no common context), i.e., x (resp. y) is 
a variant in the broad sense of y (resp. x). But, as the following theorem shows, 
the relation v is not an equivalence relation, so the considered definition of a 
phoneme is ambiguous: the same sound may belong to two different phonemes. 

Theorem. The relation of variation in the broad sense is not transitive. 

Proof. Let S = {a, b, c, d, e, /} and L{S) = {ae, ce, fa, cf, dbe, dae, dee, fae}. 
Let us define R as the relation of decomposition of J^(S') in classes of s-equi valence. 
We will show that for x = a,y = b, and u = c we have x v y and y v u, but 
we do not have x v u. Indeed, we have zaw in L{S) and zbw in L{S) iff z = d 
and w = e. On the other hand, we have dbe R dae, because dbe and dae are 
wellformed strings whose only acceptance context is the empty one. We have 
proved in this way that a: v y. It is also true that y v u, because zbv and zew 
are in L{S) iff z = d and w = e and, on the other hand, we have dbe R dee. It 
remains to observe that we do not have x v u, because if z is the empty string 
and if w = e, then the strings zxw and zuw are in L{S), but we do not have 
zxw R zuw; indeed, zxw is accepted by the context (/, A), where A is the empty 
string, while zuw is not accepted by this context. □ 

The above theorem clearly shows that the set of (abstract) sounds is not 
a multiset, but a tolerance multiset; the phonemes are the types, while the 
(abstract) sounds are the token. But the type-token relation is no longer an 
equivalence relation, it is a tolerance relation. For more details in this respect, see 
[5] (for free variation), [6] (for a first logical model of the phoneme), [10,11] (for 
the proofs of the above propositions), and [12] (for the definition of synonymy). 

4 The Topological Status of the Tolerance Multisets 

Let (^, t) be a tolerance space, i.e., a set X endowed with a tolerance relation 
t. Let 4. be a subset of X. Following section 4 of [14], we consider for any x in 
A its tolerance class t(x), i.e., the set of all objects y in X for which x t y, Let 
T*{A) be the set of all these classes, from which we select the subset T^.{A) of 
those classes t(a;) which are contained in A. The ordered pair (T*(4), T*(4)) 
will be called a tolerance multiset. In [14], it was called a tolerance rough set. 
Tsf{A) and T*(A) are the lower and the upper tolerance approximations of A by 
t. The set T*{A) — T^{A) is a rough tolerance description, in the sense of [14], 
of the boundary of A. In the particular case when the tolerance relation t is an 
equivalence relation, we get the case of usual rough sets. The rough perspective 
in the study of data was initiated by Z. Pawlak [15]. 

One can extend this approach, in order to include tolerance classes which are 
not in T*{A). These classes are not equally situated with respect to A. Their 
hierarchy can be obtained by starting to put T^{A) = T*{A). Let T^{A) be 
the set of those tolerance classes whose intersection with at least one tolerance 
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class in T^{A) is non-empty. Assuming T"{A) already defined, we consider the 
set T"+^(A) of those tolerance classes whose intersection with at least one class 
in T"(A) is non-empty. One obtains a sequence T°(A), T^(A), . . . , T"(A), . . . , 
where every term is contained in the next one. A similar approach can be adopted 
for the set To(A) = T*(A) and we get an increasing sequence To{A), Ti{A), . . 
Tn{A), . . ., where T„(A) is the set of those tolerance classes which intersect at 
least one class in T„_i(A). We got in this way two infinite sequences; the former 
is aimed at refining our understanding of the boundary region T*(A) — Tt,(A), 
while the latter helps us to a better understanding of what happens beyond 
T*{A). Obviously, if the set X is finite, then each of the considered sequences 
will become stationary after a finite number of steps. 

The behavior of the synonymy operator, considered in a previous section, 
illustrates and explains, on a concrete situation, the approximation process just 
described. 

From tolerance multisets there is only one step to Cech topology [1]. Let us 
first recall what happens with the topology t(R) generated by R, when R is an 
equivalence relation. In this case, the approximation R*(A) of the set A is just 
the interior of A, while the upper approximation R* (A) of A is the closure of A 
in the topology t(R). This topology is total, i.e., any intersection of open sets is 
open and any union of closed sets is closed (as a consequence of the fact that 
in t(R) a set is open iff it is closed). Now let us consider the case when t is a 
tolerance relation and let us recall the concept of Cech topology [1] . It is defined 
in a set X by means of a generalized closure operator cl which associates to any 
subset S' of A a set cl{S) contained in X, such that c?(0) = 0, S C d(S), and, 
if A C _B, then cl{A) C cl{B). In the particular case in which cl{S) = cl{cl{S)) 
for any subset S of A, we get the usual closure operator associated with the 
usual topology. The ordered pair (A, cl) is said to be a Cech topological space. 
Any tolerance relation t in A generates a Cech topological space, by defining 
cl{A) = {x I there exists y in A such that x t y}. The converse is not true; 
indeed, it is easy to see that the closure operator clt generated by the tolerance 
t is additive, i.e., dt(AU B) = clt{A) U clt{B), while the Cech closure operator 
does not always have this property. 

Tolerance multisets can be presented in the language of Cech topology. The 
sequence (T„(A))„ is obtained by applying the Cech closure operator clt gen- 
erated by the tolerance t successively to T'*(A), while the sequence T”(A))„ is 
obtained in a similar way, but starting with T*(A), which in turn is obtained by 
applying the operator clt to A. There exists a natural relation between tolerance 
relations and information systems. Let us consider the correspondence / asso- 
ciating with each object attribute values; / determines in the set of objects a 
tolerance relation t: the objects x and y are in the relation t if they have at least 
one attribute value in common. Conversely, one can prove that any tolerance re- 
lation can be obtained in this way. For more about tolerance in linguistics, see 
[17]. For a relation between tolerance and learning processes, see [13]. 
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Abstract. Fuzzy multisets with infinite membership sequences and their 
generalization using set-valued memberships are considered. Two metric 
spaces of the infinite fuzzy multisets are defined in terms of cardinality. 
One is the completion of fuzzy multisets of finite membership sequences; 
the other is derived from operations among fuzzy multisets and real- 
valued multisets. Theoretical properties of these infinite fuzzy multisets 
are discussed. 



1 Introduction 

Recently, many studies discuss multisets and their applications such as relational 
databases, information retrieval, and new computing paradigms. Multisets have 
sometimes been called bags. While the well-known book by Knuth [5] uses the 
term of multisets, another book by Manna and Waldinger [7] devotes a chapter 
to bags. The terms of multiset and bag can thus be used interchageably. The 
author prefers to use multisets but some readers may interpret them to be bags 
instead. 

This paper is concerned with a generalization of multisets, that is, fuzzy mul- 
tisets [16,3,14,4,13,15,8,9,10,6]. The ordinary nonfuzzy multisets are called crisp 
multisets by the common usage in fuzzy systems theory. We discuss theoreti- 
cal aspects of fuzzy multisets; in particular, the focus is on infinite features of 
multisets. Namely, infinite sequences of memberships and a distance between 
fuzzy multisets are discussed. Further generalizations of fuzzy multisets are de- 
fined using a closed set in a plane as the membership range. All arguments are 
elementary, and hence we omit proofs of the propositions. 

2 Finite Multisets and Fuzzy Multisets 

Before considering fuzzy multisets, a brief review of crisp multisets is useful. In 
the preliminary consideration we assume finite sets and multisets for simplicity. 

2.1 Crisp Multisets 

Let us begin by a simple example. 

Example 1. Assume that X = {x,y,z,w} is a set of symbols and suppose we 
have a number of objects but they are not distinguishable except their labels of 
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X, y, z, or w. For example, we have two balls with the label x and one ball with y, 
three with label z, but no ball with the label w. Moreover we are not allowed to 
put additional labels to distinguish two x’s. Therefore a natural representation 
of the situation is that we have a collection 

{x,x,y,z,z,z}. 

We can also write 

{2/a;, l/y,3/z,0/'u;} 

to show the number for each element of the universe X, or {2/a;, 1/j/, 3/z} by 
ignoring zero of w. We will say that there are three occurrences of x, two occur- 
rences of y, and so on. 

We proceed to general definitions. Assume X = {x\, . . . ,Xn} is a finite set 
called the universe or the basis set. A crisp multiset M of A is characterized by 
the function Count m{') whereby a natural number (including zero) corresponds 
to each x G X, that is [1,5,7]: 

Count M : A ^ {0, 1, 2, . . . }. 

For a crisp multiset, different expressions such as 
M = {ki/xi,...,kn/Xn} 



M = {xi,...,Xi,...,Xn,-- -,Xn} 

are used. An element of A may thus appear more than once in a multiset. In the 
above example xi appears k\ times in M, hence we have ki occurrences of x\. 

Consider the first example: {2/x, 1/y, 3/z}. We have 
CountM(x) = 2, CountM{y) = 1, CountM{z) = 3, CountM^w) = 0. 



The following are basic relations and operations for crisp multisets, 
(inclusion) : 

M C N Count m{x) < Count n{x), Vx S A. 

(equality): 



(union) : 



M = N ^ CountM{x) = Countisr{x), Vx G X. 



(intersection) : 



Count mun(x) = max.{CountM{x), Count m{x)} 
= Count m{x) V Count n{x). 



CountMnN(x) = min{CountM{x), Counti\[{x)} 
= Count m{x) a Count n(x). 



( 1 ) 

(2) 

(3) 

(4) 
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(addition): 
(subtraction) : 



CountM®N{x) = CountM{x) + CountN{x). 



CountMQN{x) = 0 A {CountM{x) — CountN{x)). 



( 5 ) 

(6) 



The symbols V and A are the infix notations of max and min, respectively. 
Readers should note that the operations resemble those for fuzzy sets, but the 
upper bound for Count{-) is not assumed. 

Example 2. Consider the multiset M in Example 1 and 

N = {l/a;,4/y,3/w}. 



Then, 



M (B N = {3/x,5/y, 3/z, 3/w}, 
MUN = {2/x,4:/y,3/z,3/w}, 
MnN = {l/x,l/y}, 

M Q N = {l/x,3/z}. 



Real-valued multisets 

Blizard [2] generalized multisets to real-valued multisets, from which we remark 
nonnegative real- valued multiplicity. Thus, the Count function takes nonnegative 
real- values: {Count : X — > U {0}). The basic relations and set operations use 

(l)-(6). 

2.2 Fuzzy Multisets 

Yager [16] first discussed fuzzy multisets, although he uses the term of fuzzy bag; 
an element of X may occur more than once with possibly the same or different 
membership values. 

Example 3. Consider a fuzzy multiset 

R = {(x, 0.2), (x, 0.3), {y, 1), {y, 0.5), {y, 0.5)} 

of Y = {x,y,z,w}, which means that x with the membership 0.2, x with 0.3, 
y with the membership 0.5, and two y’s with 0.5 are contained in A. We may 
write 

R = {{0.2,0.3}/x,{l,0.5,0.5|M 

in which the multisets of membership {0.2, 0.3} and (1,0.5, 0.5} correspond to 
X and y, respectively. 

Count a{x) is thus a finite multiset of the unit interval [16]. The collection of 
all finite fuzzy multisets of X is denoted by TXi{X). 
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For X G X, the membership sequence is defined to be the decreasingly-ordered 
sequence of the elements in Count a(x). It is denoted by 

where ^i\{x) > ti\{x) > • • • > iJ^ix). 

When we define an operation between two fuzzy multisets, say A and B, 
the lengths of the membership sequences ii\{x), ^\{x), . . . , /x^(x), and p}g{x), 
fj,g{x) , . . . , (x), should be set to be equal. We hence define the length L{x; A), 

the length of 

L{x;A) = max{j : yf 0 }, 

and define 

L{x] A, B) = max{L(x; A), L{x] B)}. 

When no ambiguity arises, we write L(x) = L{x; A, B) for simplicity. 

Example 4. 

Let 

A={{0.2,0.3}/a:,{l,0.5,0.5}M, 

B = {{0.6}/a;, {0.8, 0.6}/y, {0.1, 0.7}/tc}. 

For the representation of the membership sequence, we get 

L{x) = 2, L{y) = 3, L{z) = 0, L{w) = 2 

and we have 

^ = {(0.3, 0.2)/x, (1, 0.5, 0.5)/y, (0, 0)/u;}, 

B = {(0.6, 0)/x, (0.8, 0.6, 0)/y, (0.7, 0.1)M. 

The following are basic relations and operations for fuzzy multisets [8]: 

1. [inclusion] 

AC B y^^{x) < y^g{x), j = 1, . . . ,L{x), WxGX. (7) 

2. [equality] 

A = B y^^{x) = y^g{x), j = 1,. . . ,L{x), Vx G X. (8) 

3. [addition] 

A(BB is defined by the addition operation in X x [0, 1] for crisp multisets [16]. 
Namely, if 

A = {(xi,Hi), (xk,y,k)} 

and 

B = {(Xp, /ip), • . • , ( 3 ;^, /ir)} 
are two fuzzy multisets, then 

A (£) B — {(Xj, /ii), . . • , ( 3 ^/cj • ■ • j /ir)}- 



(9) 
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4. [union] 

Maub(^) j = (10) 

5. [intersection] 

AAis(a;), j = l,...,L(a:). (11) 

6. [a-cut] 

The a-cut {a G (0, 1]) for a fuzzy multiset A, denoted by [A]^, is a crisp 
multiset defined as follows: 

/i^(a;) < a Count[A]^{x) = 0, (12) 

> a, < a Count^A] (x) = j, (13) 

j = l,...,L(a;). (14) 

Moreover the strong a-cut (a G [0,1)), denoted ]A[^, is a crisp multiset 
defined as follows: 

I^a{x) < a Count-^A[^{x) = 0, (15) 

< a Count^^A[ {x) = j, (16) 

j = l,...,L(x). “ (17) 

7. [Cartesian product] 

Given two fuzzy multisets A = {{x,y)} and B = {(j/, jz)}, the Cartesian 
product is defined by: 

Ax B {{x,y,^l^v)] (18) 

The combination is taken for all (a;, /r) in A and {y, v) in B. 

8. [Multirelation] 

Notice that a crisp relation i? on X is a subset of X x X. Given a fuzzy 
multiset ^ of X, a multirelation TZ obtained from i? is a subset of ^ x A: 
for all {x, fj,), {y, v) G A, 

{x,y,y,Ai^)GTl {x,y) G R 

When i? is a fuzzy relation on X, then 

{x,y, y A V A R{x,y)) G TZ. (19) 

(The latter includes the former as a special case.) 

The following propositions are valid. The proofs are not difficult and hence 
omitted. 

Proposition 1. Assume that A and B are fuzzy multisets of X. The necessary 
and sufficient condition for A C B is that for all a G (0,1], [A]„ C [B]^. 
Moreover, the condition for A = B is that for all a G (0, 1], [A]^ = [B]^. 

Proposition 2. Assume that A and B are fuzzy multisets of X. The necessary 
and sufficient condition for A C B is that for all a G [0,1), ]AL C ]B[^. 
Moreover, the condition for A = B is that for all a G [0, 1), ]A[^ = ]B[^. 
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Proposition 3. Assume that A and B are fuzzy multisets of X. Take an 
arbitrary a € (0, 1). We then have: 



]^Ui?L = ]A[„U]BL, 
]A®B[^ = ]A[^®]B[^, 



[^ns]„ = [A]„n[i?]„, 

X X [B]„ 

MnsL = ]ALn]i?L, 
]^xBL = ]ALx]i?L 



Proposition 4. Assume that A, B, and C are fuzzy multisets of X. The 
followings equalities are valid: 



A\J B = B\J A, 

AC^B = BC^A, 

A U (B U C) = (A U B) U C, 

A n (B n C) = (A n B) n c, 

(A n B) U C = (A U C) n (B U C), 
(A u B) n c = (A n c) u (B n c). 



Therefore, the class of all fuzzy multisets of a particular universe forms a dis- 
tributive lattice. 



The cardinality and a metric space of fuzzy multisets 

The cardinality of a fuzzy multiset A is given by 

L{x\A) 

xGX 

Moreover, we define 

L{x;A) 

\A\x = 

i=i 

We thus have |A| = J2x(^x \A- 

We can define a metric space of fuzzy multisets. Namely, given two multisets 
A,Bg iFA4{X), the distance between A and B is defined by 

L(x\A,B) 

d{A,B)=Y^ \n\{x) - (20) 

xGX j=l 

We can immediately have 

d(A, B) > 0; d(A, S) = 0 ^ A = B, 
d{A,B) = d{B,A), 
d{A,C) < d{A,B) + D{B,C), 



hence the set of fuzzy multisets becomes a metric space. 



(21) 

(22) 

(23) 
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3 Infinite Features of Fuzzy Multisets 

So far all discussions are concerned with finite multisets. It is possible to intro- 
duce the infinity {-hoo} into the consideration of crisp multisets: Count m{x):X^ 
{0, 1, . . . , oo}. However, for the purpose of simple and effective calculations, we 
consider CountM{x) as being always finite. This assumption of finite crisp mul- 
tisets is useful for focusing the infinite features of fuzzy multisets. 

3.1 Infinite Membership Sequences 

Infinite fuzzy multisets means that for x G X, Count a(x) may be an infinite set 
of the unit interval I = [0, 1]. We note that a-cuts of a fuzzy multiset A gives 
well-defined crisp multisets [A]a and ]A[a for a G (0, 1). Since we assume a crisp 
multiset of finite cardinality {CountM{x) < oo), an infinite subset of I does not 
necessarily provide a well-defined crisp multiset. 

Hence, instead of an infinite set, we use a sequence , . . . ) as the Count 

function. Namely we assume 

Count a{x) = ,v'^ ,.. .) (24) 

where ^ 0 as j ^ 0. 

Basic relations and operations for infinite fuzzy multisets should also be de- 
fined in terms of membership sequences using (7)-(19) with the associated se- 
quences being infinite (e.g., L(x) oo). We assume that, regarding fuzzy mul- 
tisets, all sorting of the infinite membership sequence in (24) can be performed 
by an effective computational procedure. 

The class of all infinite fuzzy multisets of X is denoted here by XA4q(X). 
We immediately have the next proposition. 

Proposition 5. For A, B G tFAiQ(X), 

(i) A® Bg XMo(X), 

(ii) AuBg XMo(X), 

(iii) AnB G XMo(X). 

3.2 The Metric Space of Infinite Fuzzy Multisets 

We consider the subset of tFA4o(X) in which the cardinality is finite: 

OO 

|H| = ^ ^ f^^^(x) < oo. (25) 

xex j=i 

This subset is denoted by tFA4i(X): 

XMi(X) = { |H| < oo : H G XMo{X) }. 

For A,Bg XA4i{X), the distance is naturally defined by 

OO 

d{A,B) \dAi^) - 

xGX j^l 



(26) 
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The axioms of a metric, (21)-(23), are satisfied for d{A,B). 

The next proposition is similar to Proposition 5. 

Proposition 6. For A,Bg 

(i) A®Be BMi{X), 

(ii) AuBg BMi(X), 

(iii) AdB G BMi{X). 

The space !FA4i{X) of infinite fuzzy multisets is justified as follows. Sup- 
pose we have an infinite Cauchy sequence of finite fuzzy multisets Ai, A 2 , ■ ■ • G 
XM(X): 

d{Ai,Aj) — > 0, as i,j 00 . 

Then, the sequence converges to a fuzzy multiset A G BA4i(X). On the other 
hand, for an arbitrary A G BMi{X), there exists a sequence of finite fuzzy 
multisets ZI 2 , • • • G TM.{X) such that A' ^ A as j ^ 00 . 

The metric space of infinite fuzzy multisets is complete, whereas that of finite 
fuzzy multisets is not. 



4 A Set- Valued Membership: 

A Generalization of Fuzzy Multisets 



The membership sequence can be identified with a real- valued step function with 
the nonnegative real variable y G U{0} 



fA(y; x) 



(y = 0). 



A crisp multiset M is represented by a simple function of 



(27) 



fM{y,x) = 1, 0<y< Count m{x), (28) 

which is a particular case of Equation (27) by putting = 1 {j = 1, , 

CountM{x)). Notice that the last function, (28), is naturally extended to the 
case of positive real-valued multisets using the same definition (28) except that 
CountM{x) is real-valued. 

We thus assume that, instead of the membership sequences, fuzzy multisets 
and positive-real valued multisets are characterized by these functions. Let us 
consider these functions in a unified framework, namely, we allow operations 
between fA{y,x) by (27) and fM{y,x) by (28). 

The operations of union and intersection are immediately defined by using 
fA{y, x) V fM{y, x) and /a(j/; x) a /m(j/; x), respectively. On the other hand, the 
addition, which is characterisctic to the multisets, cannot directly be defined 
from these functions. 

For the operation of the addition, we use a closed set ua{x) in the (y, z)-plane 
naturally derived from fA{y,x). Namely, we define 

^a{x) = {{y,z) G : 0<y < supp/a(-;x), 0 < 2 < fA{y,x) }, (29) 
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where supp /a(s a;) = { y : fA{y',x)^0}. We moreover define a function gA{'', x) 
of the variable z: 

gA{z]x) =SMY>{y : {y,z) &va{x)]. (30) 

We can now define the addition operation by using 

gA(BM{z]x) = gA{z]x) + gM{z]x), (31) 



whereby the set vacbb{x) is defined by 



'^A(bb{x) = {{y,z) € B? ■. 0 < z < suppy^®M(-;a;), 

Q <y < gA®B{z]x)]. (32) 



We now proceed to generalize the class of functions fA{y, x). This generaliza- 
tion is necessary when we consider fuzzy multisets and real-valued multisets in 
the same framework. We easily note that the arbitrary monotone non-increasing 
step functions are included in this set since they are derived from operations 
between fuzzy multisets and real-valued multisets. 

Notice also that the cardinality is extended to the norm of the function 
fA{y, x); thus we define 



\fA{-;x)\ 




fA(y; x)dy, 



ii/a|| = Ei/^(-^)I- 



(33) 

(34) 



An appropriate class of /a is the set of monotone nonincreasing and upper- 
semicontinuos functions whose norms are finite. This class is denoted by 

GTMi{X) = {fA : II/aII <oo}. 



This class is justified from the fact that for an arbitrary / G Q!F Aii{X), we can 
take a sequence of monotone non-increasing step functions that converges to /. 
This class is moreover the completion of the class of all monotone nonincreasing 
step functions whose cardinality is finite. 

By abuse of terminology we write A G GXA4i(X). Since Ja and the associ- 
ated va{x) is interchangeable, we are considering a generalized fuzzy multiset A 
characterized by the closed set va{x) for each x G X. 

Operations on the generalized fuzzy multisets are now summarized. 



(i) inclusion 

(ii) equality 



AC B I'Aix) C vb{x), Vx G X. 

A = B va{x) = vb{x), Vx G X. 



(iii) union 



vavjb{x) = va{x) U jzb(x), 



Vx G X. 
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(iv) intersection 

t^Ansix) = va{x) n vb{x), Wx e X. 

(v) addition 

Define gA{z;x) and gB{z;x) by (30), and calculate gA<^B{z;x) using (31), 
whereby va®b{x) is obtained from (32). 

It is easily seen that the QTXii is a distributive lattice. This class is a metric 
space where the metric is defined by 

pOO 

d{A,B) = WfA- !b\\ = '^ / \lA{y]x) - fB{y,x)\dy. 

x&X "'0 



5 Conclusion 

We have reviewed fuzzy multisets and discussed features of infiniteness in fuzzy 
multisets. It should be noticed that an a-cut of such a fuzzy multiset with 
an infinite membership sequence will produce a finite crisp multiset. Further 
generalization using a closed set in a plane has been considered. 

We thus observe analytical features in fuzzy multisets in addition to algebraic 
properties from the discussion of metric spaces and analytical completion. 

Future studies include further development of the theory, e.g., t-conorms in 
fuzzy multisets and further generalization of the multisets. Moreover, the relation 
between fuzzy multisets and rough sets [12] should be investigated. 

Applications of fuzzy multisets include modeling of information retrieval sys- 
tems [11], which we will discuss elsewhere. 

References 

1. W.D. Blizard, Multiset theory, Notre Dame Journal of Formal logic, Vol. 30, No. 1, 
pp. 36-66, 1989. 

2. W.D. Blizard, Real-valued multisets and fuzzy sets. Fuzzy Sets and Systems, 
Vol. 33, pp. 77-97, 1989. 

3. B. Li, W. Peizhang, L. Xihui, Fuzzy bags with set-valued statistics, Comput. Math. 
Applic., Vol. 15, pp. 811-818, 1988. 

4. K.S. Kim and S. Miyamoto, Application of fuzzy multisets to fuzzy database sys- 
tems, Proc. of 1996 Asian Fuzzy Systems Symposium, Dec. 11-14, 1996, Renting, 
Taiwan, R.O.C. pp. 115-120, 1996. 

5. D.E. Knuth, The Art of Computer Programming, Vol.2 / Seminumerical Algo- 
rithms, Addison- Wesley, Reading, Massachusetts, 1969. 

6. Z.-Q. Liu, S. Miyamoto (Eds.), Soft Computing and Human-Centered Machines, 
Springer, Tokyo, 2000. 

7. Z. Manna and R. Waldinger, The Logical Basis for Computer Programming, Vol. 
1: Deductive Reasoning, Addison- Wesley, Reading, Massachusetts, 1985. 

8. S. Miyamoto, Fuzzy multisets with infinite collections of memberships, Proc. of 
the 7th International Fuzzy Systems Association World Congress (IFSA’97), June 
25-30, 1997, Prague, Chech, Vol.l, pp. 61-66, 1997. 




Fuzzy Multisets and Their Generalizations 



235 



9. S. Miyamoto, K.S. Kim, An image of fuzzy multisets by one variable function and 
its application, J. of Japan Society for Fuzzy Theory and Systems, Vol. 10, No. 1, 
pp. 157-167, 1998 (in Japanese). 

10. S. Miyamoto, K.S. Kim, Multiset-valued images of fuzzy sets, Proeeedings of the 
Third Asian Fuzzy Systems Symposium, June 18-21, 1998, Masan, Korea, pp.543- 
548. 

11. S. Miyamoto, Rough sets and multisets in a model of information retrieval, in 
F.Crestani and G.Pasi, eds.. Soft Computing in Information Retrieval: Techniques 
and Applications, Springer, pp. 373-393, 2000. 

12. Z. Pawlak, Rough Sets, Kluwer, Dordrecht, 1991. 

13. A. Ramer, G.-G. Wang, Fuzzy multisets, Proc. of 1996 Asian Fuzzy Systems Sym- 
posium, Dec. 11-14, 1996, Renting, Taiwan, pp. 429-434. 

14. A. Rebai, Canonical fuzzy bags and bag fuzzy measures as a basis for MADM with 
mixed non cardinal data, European J. of Operational Res., Vol. 78, pp. 34-48, 1994. 

15. A. Rebai, J.-M. Martel, A fuzzy bag approach to choosing the “best” multiat- 
tributed potential actions in a multiple judgement and non cardinal data context, 
Fuzzy Sets and Systems, Vol. 87, pp. 159-166, 1997. 

16. R.R. Yager, On the theory of bags, Int. J. General Systems, Vol. 13, pp. 23-37, 
1986. 




Universality Results 
for Some Variants of P Systems* 



Madhu Mutyam and Kamala Krithivasan 



Department of Computer Science and Engineering 
Indian Institute of Technology, Madras 
Chennai-36, Tamil Nadu, India 
madhuOmeena . iitm . ernet . in 
kamalaSiitm . ernet . in 



Abstract. P systems, introduced by Gh. Paun, form a new class of 
distributed computing models. Many variants of P systems were shown 
to be computationally universal. In this paper, we consider several classes 
of P systems with symbol-objects where we allow the catalysts to move in 
and out of a membrane. We prove universality results for these variants 
of P systems with a very small number of membranes. 



1 Introduction 

P systems are a class of distributed parallel computing devices of a biochemical 
type, introduced in [8], which can be seen as a general computing architecture 
where various types of objects can be processed by various operations. In the 
basic model one considers a membrane structure consisting of several cell-like 
membranes placed inside a main membrane, called the skin membrane. If a 
membrane does not contain any other membrane, then it is called an elemen- 
tary membrane. A membrane structure is represented by means of well-formed 
parenthesized expressions, strings of correctly matching parentheses, placed in 
a unique pair of matching parentheses which represents the skin membrane. 
Each pair of matching parentheses corresponds to a membrane. Graphically a 
membrane structure is represented by a Venn diagram without intersection and 
with a unique superset. The membranes delimit regions, where we place objects, 
elements of a finite set (an alphabet). The objects evolve according to given 
evolution rules, which are associated with regions. The rules are used in a non- 
deterministic maximally parallel manner: the objects to evolve and the rules 
to be applied to them are chosen in a nondeterministic manner, but no object 
which can evolve at a given step should remain idle. An object can evolve inde- 
pendently of the other objects in the same region of the membrane structure, or 
in cooperation with other objects. In particular, catalysts are considered. Cat- 
alysts are objects which evolve only together with other objects, but are not 
modified by the evolution (they just help other objects to evolve). The evolution 
rules are given in the form of multiset transition rules and in their right hand 
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side contain pairs {a, here), {a, out), (a, in), where a is an object. The meaning 
is that one occurrence of the symbol a is produced and it remains in the same 
region, is sent out of the respective membrane, or is sent nondeterministically 
into one of the inner membranes (which should be reachable from the region 
where the rule is applied), respectively. 

Another feature considered by Gh. Paun in [9] is the possibility to control 
the membrane thickness (by using the actions of r and <5). This is done as 
follows: Initially, all membranes are considered to be of thickness 1. If a rule in 
a membrane of thickness 1 introduces the symbol r, then the thickness of the 
membrane increases to 2. A membrane of thickness 2 does not become thicker 
by using further rules which introduce the symbol r, but no object can enter 
or exit it. If a rule which introduces the symbol S is used in a membrane of 
thickness 1, then the membrane is dissolved; if the membrane had thickness 2, 
then it returns to thickness 1. Whenever the skin membrane is dissolved, the 
whole membrane system will be destroyed. If at the same step one uses rules 
which introduce both 6 and r in the same membrane, then the membrane does 
not change its thickness. No object can be communicated through a membrane 
of thickness two, hence rules which introduce commands out, in, requesting such 
communications, cannot be used. However, the communication has priority over 
changing the thickness: if at the same step an object should be communicated 
and a rule introduces the action r, then the object is communicated and after 
that the membrane changes the thickness. 

Starting from an initial configuration (identified by the membrane structure, 
the objects - with multiplicities - and rules placed in its regions) and using 
the evolution rules, we get a computation. We consider a computation complete 
when it halts, no further rule can be applied. Two ways of assigning a result to 
a computation were considered: one by designating an elementary membrane as 
the output membrane, and the other by reading the result outside the system. 
We deal here with the latter variant. 

In [2], with one catalyst and without moving the catalyst across membranes, 
universality of P systems was achieved with four membranes. If we relax this 
condition by allowing movement of catalysts across membranes, with three mov- 
ing catalysts we can achieve universality with only 3 membranes, and if there 
is no bound on the number of catalysts, then we can achieve universality with 
only 2 membranes. With three moving catalysts, we can achieve universality of 
uniform P systems (open problem suggested by Gh. Paun in [11]) with 3 mem- 
branes. Similarly, with three moving catalysts, we can achieve universality of P 
systems with communicating rules (open problem suggested by Gh. Paun in [11]) 
with 4 membranes. Finally, in [6], with one catalyst and without moving the cat- 
alyst across membranes, universality of P systems with membrane creation was 
achieved with the degree (1,4). If we use moving catalysts, then we can achieve 
universality of P systems with membrane creation with the degree (1,3). 

In the next section we give some prerequisites. In Section 3 we define P sys- 
tems with moving catalysts. Section 4 discusses uniform P systems with symbol- 
objects, Section 5 considers P systems with communicating rules. Section 6 dis- 
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cusses P systems with membrane creation and the paper concludes with a brief 
note in Section 7. 

2 Prerequisites 

For an alphabet V, V* is the free monoid generated by V, A is the empty string, 
and = V* — {A}. The length of x G V* is denoted by |x|, while \x\a is the 
number of occurrences of the symbol a in the string x. If V = {oi, • • • , a„} (the 
ordering is important), then <Fy(a;) = (|x|ai, • • • , |a;|a„) is the Parikh vector of 
the string x G V*. For a language L G V'*,<Fy(L) = {lFy(a;) \ x G L} is the 
Parikh set of L. 

A matrix grammar with appearance checking is a construct G = {N, T, S, 
M, F), where N, T are disjoint alphabets, S G N, M is a finite set of sequences 
of the form {Ax xi, • • • , A„ ^ Xn), n > 1, of context-free rules over N UT 
(with Ai G N,Xi G {N U T)*, in all cases), and F is a set of occurrences of rules 
in M {N is the nonterminal alphabet, T is the terminal alphabet, S is the axiom, 
while the elements of M are called matrices.) 

For w, z G {Nut)* we write w ^ z if there is a matrix {A\ ^ xi, • • • , A„ ^ 
Xn) in M and the strings Wi G {N U T)* , 1 < i < n + 1, such that w = wi,z = 
w„+i, and, for all 1 < i < n, either (l)r(;i = w{Aiw'/ , Wi+i = w{xiw" , for some 
w[,w" G {N U r)*, or {2)wi = Wi+\,Ai does not appear in ruj, and some rule 
Ai Xi appears in F. (The rules of a matrix are applied in order, possibly 
skipping the rules in F if they cannot be applied, so we say that these rules are 
applied in the appearance checking mode.) 

The language generated by G is defined by L{G) = {w G T* | S' w}. 
The family of languages of this form is denoted by MATac- It is known that 
MATac = RE. 

A matrix grammar G = {N, T, S, M, F) is said to be in the binary normal 
form if Af = iVi U A ^2 U {S, f}, with these three sets mutually disjoint, and the 
matrices in M are in one of the following forms: 



1. 




^ XA), with X gNx, A G N 2 ] 


2. 


(A- 


-^Y,A- 


x), with X,Y G Ni,A G N 2 , x G {N 2 U T)* 


3. 


(A- 


-^Y,A- 


t), with A, F e Ni,A G N 2 ] 


4. 


(A- 


A,A- 


x), with A G Ni, A G N 2 ,x G T*; 



Moreover, there is only one matrix of type 1 and F consists exactly of all rules 
A ^ \ appearing in matrices of type 3; f is called a trap symbol, because once 
introduced, it is never removed. A matrix of type 4 is used only once, in the last 
step of a derivation. 

According to [1], for each matrix grammar there is an equivalent matrix 
grammar in the binary normal form. 

For an arbitrary matrix grammar G = {N,T, S, M, F), let us denote by 
ac{G), the cardinality of the set {A G N \ A ^ a G F}. From the construction 
in the proof of Lemma 1.3.7 in [1] one can see that if we start from a matrix 
grammar G and we get the grammar G' in the binary normal form, then ac{G') = 
ac{G). 
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Improving the result from [7] (six nonterminals, all of them used in the ap- 
pearance checking mode, suffice in order to characterize RE with matrix gram- 
mars), in [3] it was proved that four nonterminals are sufficient in order to 
characterize RE with matrix grammars and out of them only three are used 
in appearance checking rules. Of interest here is another result from [3]; if the 
total number of nonterminals is not restricted, then each recursively enumerable 
language can be generated by a matrix grammar G such that ac(G) < 2. 

Consequently, to the properties of a grammar G in the binary normal form 
we can add the fact that ac(G) < 2. This is called the strong binary normal form 
for matrix grammars. 

A P system (of degree m > 1) is a construct 

E[ — (V, T j (7, /r, rci , * * * , Wm , Ri , * * * , Rmf} ? 



where: 

1. V is an alphabet; its elements are called objects; 

2. TCP, is the output alphabet; 

3. G CV,G DT = (j), is the set of catalysts; 

4. /i is a membrane structure consisting of m membranes, with the membranes 
and the regions labeled in a one-to-one manner with elements in a given set; 
in this section we use the labels 1,- • • ,rn; 

5. Wi, 1 < i < m, are strings representing multisets over V associated with the 
regions 1,- • • , m of yi; 

6. Ri, 1 < i < m, are finite set of evolution rules over V associated with the 
regions 1, • • • , m of /i; 

An evolution rule is of the form u —>■ v, where u is a string over V and v = v' 
or V = v'S or V = v't, where v' is a string over {(a, here), (a, out), (a, in) \ a G 
(V — C)}, and S and t are special symbols not in V. The length of u is called 
the radius of the rule u ^ v. 

When presenting the evolution rules, the indication here is in general omitted. 

If n contains rules of radius greater than one, then we say that 7T is a system 
with cooperation. Otherwise, it is a non-cooperative system. A particular class of 
cooperating systems is that of catalytic systems: the only rules of radius greater 
than one are of the form ca cv, where c G C,a G {V — G), and v contains no 
catalyst; moreover, no other evolution rules contain catalysts (there is no rule of 
the form c ^ v or a ^ V\CV 2 , for c G G). 

The membrane structure and the multisets in U constitute the initial config- 
uration of the system. We can pass from a configuration to another one by using 
the evolution rules. This is done in parallel: all objects, from all membranes, 
which can be the subject of local evolution rules, should evolve simultaneously. 
A rule can be used only if there are objects which are free at the moment when 
we check its applicability. 

The application of a rule ca ^ cv in a region containing a multiset w means 
to remove a copy of the object a in the presence of a copy of c, providing that 
such copies exist, then follow the prescriptions given by v: If an object appears 
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in V in the form {b,here), then it remains in the same region; if it appears in 
the form (b,out), then a copy of the object b will be introduced in the region 
of the membrane placed outside the region of the rule ca —>■ cv; if it appears in 
the form (6, in), then a copy of b is introduced into one of the inner membranes, 
nondeterministically chosen. 

A sequence of transitions between configurations of a given P system U 
is called a computation with respect to U. A computation is successful if and 
only if it halts, that is, there is no rule applicable to the objects present in the 
last configuration. The result of a successful computation consists of the vector 
describing the multiplicity of all objects from T which are sent out of the system 
during the computation. The set of all vectors of natural numbers produced in 
this way by a system 77 is denoted by 7V(77). The family of all such sets N{II) 
of vectors of natural numbers generated by a P system with catalysts and the 
actions of both S,t, and of degree at most m,m > 1, using target indications 
of the form here, out, in, is denoted by NPm{Cat,i/o, S,t); when one of the 
features a G {Cat, S, r} is not present, we replace it with na. When the number 
of membranes is not bounded, we replace the subscript m with *. 

By PsRE, we denote the family of recursively enumerable sets of vectors of 
natural numbers, i.e., the sets which are Parikh images of recursively enumerable 
languages. If we can compute PsRE with a class of P systems, we say that 
universality can be achieved in that class of P systems. 

3 P Systems with Moving Catalysts 

So far in all the variants of P systems with symbol-objects, the evolution rules 
with a catalyst were of the form ca cv, where c G C, v = v' or v = v'S or 
V = v't, where v' is a string over {(a, here), (a, out), (a, in) \ a G (V — C)}. So, 
a catalyst will never go out of the region where it was initially placed. 

Here we remove this restriction and allow a catalyst to move from one 
membrane to another one. At the same time we stick to the point that a 
catalyst evolves only together with other objects, but is not modified by the 
evolution (they just help other objects to evolve). In this new framework, an 
evolution rule with a catalyst will be of the form ca —>■ (c, tar)v, where c G 
C, tar G {here, out, in}, v = v' or v = v'S or v = v't, where v' is a string over 
{(a, here), (a, out), (a, in) \ a G {V — C)}. 

The family of sets of vectors of natural numbers N{1J) generated by P sys- 
tems with moving catalysts, and the actions 6,t, and of degree at most m > 1, 
using target indications of the form here, out, in, using at most n catalysts, is 
denoted by NPm{MCatn,i/o,5,T). In [2], authors proved 

PsRE = NP 4 {Cat, i/o, 6, t). 

Here we improve this result by achieving universality with only 3 membranes 
with 3 moving catalysts. If there is no bound on the number of catalysts, then 
we can achieve universality with only 2 membranes. 
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Theorem 1. PsRE = NP^{MCat^,i/o^5,T). 

Proof. We prove only the inclusion PsRE C NP^{MCat^,i/o,5,T), the reverse 
inclusion can be proved in a straightforward manner. Let us consider a matrix 
grammar with appearance checking, G = (N,T, S, M, E), in the strong binary 
normal form, that is with N = NiU N 2 U {S',!}, with ac{G) < 2, and with the 
rules of the four forms mentioned in Section 2. 

Assume that we are in the worst case, with ac{G) = 2, and let and 
be the two symbols in N 2 for which we have rules B^^'> ^ f in matrices of 
M. Let us assume that we have h matrices of the form m' : {X 
]),X,Y G NiG G {1,2},1 < i < h, and k matrices of the form rm : {X ^ 
a,A^ x), X G A^i, A G N 2 , a G iVi U {A}, and x G {N 2 U T)*, 1 < i < k. The 
matrices of the form {X Y,B^^'> — > f),A, T G fVi are labeled by to', with 
i G labj, for j G {1, 2}, such that labi, lab 2 and labo = {1, 2, • • • , fc} are mutually 
disjoint sets. 

We construct a P system with symbol-objects of degree 3 as 
n = (P,T, C,/X,Wi,W2,W3,i?i,i?2,i?3), 
with the following components: 



y = iVi U iV 2 U A, I A G fVi, A G A^ 2 , 1 < * < 3fc} 

U {A'„ A'3, I A G w, 1 < z < fc} U {Ai, Af I A G Ai, 1 < i < /i} 
U {Z, E, Ei,-\} U {cq. Cl, C 2 }; 

C = {co,ci,C2}; 

M = [l[2[3]3]2]i; 



Wi = (j)] 

W2 = {co,ci,C2, A, A}; 
W 3 = {E}] 



R\ contains the following rules: 



1 . 


Xsi- 


-2 


^ Asi-iZr, 1 < i 


< 


2. 


Asi- 


2 


A^i-xb, 1 < z 


< 


k; 


3. 


Xsi- 


-1 


^ XsiZr, 1 < z 


< 


k; 


4. 


Asi- 


1 


A^ib, 1 < z < 


k; 




5. 


Xzi 




A 3 , At, 1 < z < 


k 


— 


6. 


Asi - 




A'^ib, l<i<k 


- 


1 ; 


7. 


^ 3 . 




A 3 Z +1 At, 1 < z 


< 


k 


8. 






Asi+i^, 1 < z < 


k 


- 


9. 




(A, out); 






10 . 




t 









11. coA 3 i ^ {cQY,in), for nii : (A ^ A, A ^ x), 1 < z < A:; 

12. A 3 i ^ A, for TOi : (A ^ A, A ^ x), 1 < z < fc; 

13. A^i (x, in), for rrzi : (A ^ a, A ^ x), 1 < z < fc, a G Ai U {A}; 

14. A3fc ^ t; 
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15. Agfc ^ t; 

16. a ^ {a,out),a G T; 

17. t ^ t; 

i ?2 contains the following rules: 

1. X ^ {Xi, out)r; 

2. cqA {cqAi, out)S; 

3. c,X ^ [c,Xi,in), for m' : (X ^ ^ t),J e {1,2}; 

4. A ^ {A,in),AG N 2 ] 

5. a ^ {a,out),a G T; 

i ?3 contains the following rules: 

1. if ^ Eit; 

2. El -> ES] 

3- CjX^^ (cjY,out), for m' : {X Y,BA) f),j e {1,2}; 

4. A ^ {A, out); 

5. c,i?W^c,t,jG{l,2}; 

6. t ^ t; 

The system works as follows. 

Initially, membrane 2 contains the objects co,ci,C 2 ,vV, and A. If we want to 
simulate a matrix rrii of type 2, then the object X will be replaced with Xi and 
sent out when also increasing the thickness of membrane 2. At the same time, 
the object A with the help of the catalyst cq will be replaced with Ai and sent 
out when also decreasing the thickness of membrane 2. Since we apply these 
two rules simultaneously, the thickness of membrane 2 will not be changed. On 
the other hand, if we apply only X —>■ (Xi,out)r, then a trap symbol will be 
introduced in the skin membrane by using the rule Z —>■ ) (since we can not 
apply the rule Z (X,out)). Similarly, if we apply only cqA (coAi,out)S, 
then the whole system will be destroyed by using a rule A\ —>■ A 2 S in the skin 
membrane. So, after the simultaneous application of the rules X (Xi,out)r 
and CqA (cqAi, out)S, the skin membrane gets Xi and cqAi. The subscripts 
of X and A are incremented step by step. At any time we can apply the rules 
coXsi (coY,in) and A^i (x,m). If we apply these two rules in different 
steps, then either a trap symbol will be introduced or the whole system will be 
destroyed. Thus, we can simulate a matrix rrii of type 2. 

If we simulate a matrix of type 4, then the procedure is the same as 
above, excepting the last step, where we apply the rule X^i — > A instead of 
coXsi (cqY, in). After simulating a matrix m-i of type 4, if there is any symbol 
A G N 2 present in membrane 2, then it will keep on evolving by using the rules 
A —>■ (A, in) in membrane 2 and A — > (A, out) in membrane 3, so that the 
computation never halts. 

If we simulate a matrix m' of type 3, then we use the rules cjX (cjX)!,m), 
for m) : {X ^ Y, f), j G {1,2}, and A {A, in), A G N 2 - If we use the 

rule CqA {coAi,out)S simultaneously with the above set of rules, then the 
whole system will be destroyed by using the rule Ai A2i5. Once the objects 
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Cj,Xf and A,AgN 2 enter membrane 3, its thickness will be increased to 2 (due 
to the application of the rule E Eit). As the thickness of the membrane 3 
is two, we cannot apply the rules CjXf (cjY,out), for m' : (A ^ 
t), j G {1,2} and A {A, out). At this step, if there is a nonterminal in 
membrane 3, then we can apply the rule CjB^^^ Cj],j G {1,2}, so that the 
computation never halts. Otherwise, after reducing the thickness to 1 (by using 
the rule Ei ES), we can apply ^ (cjY,out), for to' : (A ^ Y,B^^^ 
t), j G {1, 2} and A (A, out). In this way, we can simulate a matrix to' of type 

3. Thus, the equality Et{L{G)) = A(77) follows. □ 

Theorem 2. PsRE = NP 2 {MCat^.,i/o,S,T). 

Proof. We proceed as in the previous proof, by considering a matrix grammar in 
the strong binary normal form, G = (A, T, S, M, P). Let us assume that we have 
h matrices of the form to' : (A ^ Y, B^^'^ t), Y G Ai, j G {1, 2}, 1 < t < h, 

and k matrices of the form rrii : (X ^ Y, A ^ x), X € Ni,A G A2, Y G AiU{A}, 
and X G (A2 U T)* , 1 < i < k. Each matrix of the form (A ^ A, A ^ x), A G 
Ai, A G A2, a; G T*, is replaced by (A ^ /, A ^ x), where / is a new symbol. 
We continue to label the obtained matrix in the same way as the original one. 
Here we consider the catalyst set C = {ci, • • • , Cfe, di, ^2} where each Ci is for 
TOi : (A ^ y, A ^ a;), 1 < i < A: and each dj is for niY. {X ^ Y, B^^^ ^ t), i G 
{1, 2}. We construct a P system with symbol-objects of degree 2 as 

n = (P,r, C,^,wi,r(;2, Ai,i?2), 

with the following components: 

P = AiUA2U{A',A^Ai I AgAi}U {/,/!,/',}} U{ci,C2,---,Cfc,di,d2}; 

C = {ci, C2, • • • , Cfc, ^1,^2}; 

M = [l[2]2]i; 

Wi = {ci,C2,- • • ,Cfc,di,d2, A}, for (S' ^ AA) of G; 

W 2 = {A}, for (S ^ AA) of G; 

R\ contains the following rules: 

1. CiX {ciY' , in), for TOi : (A ^ y, A ^ a;), y G Ai U {/}, of type 2 and 4; 

2. djX (djY^,in), for to' : (A ^ Y,B^^^ f),j G {1,2}, of type 3; 

3. / - A; 

4. a ^ (a, out) , a G T; 

5. Ai ^t,AG Ai; 

6 . A — > f , A G A 2 ; 

7. t-t; 

i?2 contains the following rules: 

1. y'^yiT,y G Aiu{/}; 

2. a A ax6, for TOi : (A ^ y, A ^ x), y G Ai U {/}, of type 2 and 4; 
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3. 


gYi 


^ (cA,out),Y G A^i; 


4. 


Yi - 


^Yi,YeNi; 


5. 


fi- 


^fS; 


6. 


/- 


/; 


7. 


Y^ - 




8. 


djB^Y for nii : {X ^ Y,B^d) g {1,2}, of type 3; 


9. 


djY 


^ {djY,out),j G {1,2}; 


10. 


a 


(a, out), a G T; 


11. 




t; 



The system works as follows. 

The skin membrane contains Ci, • • • , c^, c?i, ^ 2 , and X and membrane 2 con- 
tains A. (The symbols X and A are for the matrix (S' ^ XA)) 

To simulate a matrix rrii of type 2 or 4, with the help of a catalyst c^, X will be 
replaced with Y' ,Y € Ni U {/}, and move into the inner membrane 2 along with 
Ci- In membrane 2, Y' will be replaced with Yi and the thickness of the membrane 
will be incremented. At the same time if any symbol A G N 2 is present (for a 
matrix nii : {X Y, A ^ x)), then with the help of the catalyst Ci, it will be 
replaced with x and the thickness will be decremented. Otherwise, the thickness 
is 2 so that we cannot apply the rule CiY\ (ciY, out) and the computation 
never halts due to the application of the rule Yi ^ Yi,Y G A^i U {/}. On the 
other hand, if there are multiple copies of A G A ^2 for rrii : {X Y,A x), 
and instead of applying the rule cAi (ciY,out), we apply again the same 
rule CiA CixS, then a trap symbol will be introduced in the skin membrane 
by using the rule Yi ^ f,V G A^i. If we simulate a matrix of type 4, then the 
symbol / will be erased in the skin membrane. After simulating a matrix of type 
4, if there is any symbol A G N 2 is present in the skin membrane, then a trap 
symbol will be introduced using the rule A 

To simulate a matrix m' of type 3, with the help of a catalyst dj,j G {1, 2}, 
X will be replaced with Y^ and move into the inner membrane 2 together with 
dj. In membrane 2, Y^ will be replaced with Y. At the same time, if any symbol 
G N 2 (for nii : {X ^ ^ g {1,2}) is present in the membrane 

2, then a trap symbol will be introduced by using the rule djE^) dj] so that 
the computation never halts. Otherwise, with the help of the same catalyst dj, 
Y will be sent out (using the rule djY —>■ (djY,out)) from membrane 2. In this 
way, we can simulate a matrix of type 3. Thus, the equality 'Ft{L{G)) = N{U) 
follows. □ 

4 Uniform P Systems with Symbol-Objects 

In [5], the author considered a variant of P systems: simple or uniform P systems 
based on rewriting and splicing. Here we consider another variant, i.e., uniform 
P systems with symbol-objects. The idea of having such a system and to study 
its properties was suggested as an open problem in [11]. Uniform P systems are 
the systems for which we have a single set of rules for all the membranes. 




246 Madhu Mutyam and Kamala Krithivasan 



Formally, a uniform P system with symbol- objects of degree m,m > 1 is a 
construct 

n = ,Wjn,R), 

with the usual notation as given in Section 2, and where i? is a finite set of 
evolution rules over V. 

As usual, an evolution rule is of the form u —>■ v, where u is a string over V and 
V = v' or V = v'S or V = v't, where v' is a string over {(a, here), (a, out), (a, in) \ 
a € {V — C)}, and S and r are special symbols not in V. The computations in 
n are defined as in a system having R as present in all its regions. The family 
of sets of vectors of natural numbers N{II) generated by uniform P systems 
with symbol objects with at most n moving catalysts, and the actions 5, r, and 
of degree at most m > 1, using target indications of the form here, out, in, is 
denoted by NUPm{MCatn,i/o,S, t). 

Theorem 3. PsRE = NU P^{MCatz,i/ o,5,t). 

Proof. We proceed as in the previous proofs: starting from a matrix grammar 
in the strong binary normal form, G = (N, T, S, M, F), we construct a uniform 
P system with symbol-objects of degree 3 as 

n = (V,T,C,fi,wi,W2,W3,R), 

with the following components : 



V = N 1 UN 2 U {X„ Ai,A' \ X € Ni,Ag N2,l<i< 3k} 

u {X'^i \ x eNi,i<i<k}u{xl,x^\x eNi,i<i<h} 

U {Z, E, El,]} U {co. Cl, C2}; 

C = {co,ci,C2}; 

h = [l[2[3]3]2]i; 

Wi = (j; 

W2 = {co,ci,C2, A, A}; 

W3 = {E}-, 



R contains the following rules: 



1. 


A - 


> {Xi,out)r; 






2. 


Co A 


(cqAi, ouf)S; 






3. 


Asi- 


-2 ^ X^i-iZr ,1 < i 


< k; 


4. 


A3i- 


-2 ^ A^i-i5, 1 < z 


< 


k; 


5. 


Asi- 


-1 ^ XsiZr, 1 < i 


< 


k; 


6. 


A3i- 


-1 ^ A^iS, 1 < z < 


k] 




7. 


Asi 


^ 1 < z < 


k 


-1; 


8. 


^3i ■ 


AgjJ, 1 < Z < fc 


- 


1; 


9. 


^3. 


Agi+iZr, 1 < z 


< 


k-1 


10. 




Aoi+iJ, 1 < z < 


k 


-1; 
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11. 




(A, out); 








12. 




' t; 








13. 


coXzi (coY,in), for 


: 




^ Y, A ^ x),l < i < k; 


14. 


Xsi 


A, for TOi : {X - 


-A, 


A- 


^ x),l < i < k; 


15. 


A^i - 


(x, in), for mi : 




^ a 


, A ^ x),l < i < k,a G N\\J {A}; 


16. 


Xsk 


t; 








17. 


Ask 


^ t; 








18. 


c,X 


^ {cjXf,in), for 


m' : 


{X 




19. 




■ {A',in),AG Ny, 








20. 


E ^ 


■ Eit; 








21. 


El - 


ES; 








22. 


C 3 X 1 


] (cjY,out), for 


m'i 


:{X 


^Y,BA) ^]),jG { 1 , 2 }; 


23. 


A' - 


■> {A, out), A G N 2 








24. 


{1,2}; 






25. 


a ^ 


(a, out), a G T; 








26. 




t; 









The proof of the equality = N{II) is similar to Theorem 1. The main 

difference is in rules 19 and 23, where a symbol object A G V becomes A! while 
it enters membrane 3, and becomes A while comes back to membrane 2. □ 



5 P Systems with Communicating Rules 

We consider now another open problem from [11], about P systems with no 
object evolving rules, i.e., no rule of the form a be. So, such a system consists 
of only rules used for object communication from one region to another region. 
Because no object can be produced inside the system, we have to consider rules 
for bringing objects from a specified source, for example, from the outer region. 
This can be done by considering a new command associated with objects, besides, 
here, in, out, for instance, come; rules introducing symbols (b, come) will be used 
in the skin membrane only, with the obvious meaning, of bringing a copy of b 
from outside of the system. Note that even simple rules of the form a b are not 
allowed, here we have either to allow changing of objects when passing through 
a membrane, or to simulate them, for instance, by sending the symbol a outside 
the system and, at the same time, bringing b from the outer region. 

Formally a P system with communicating rules of degree m> 1, is a construct 

Ph — (F, P , C , * * * , Wm. , Pi 5 ‘ ‘ ‘ 5 Pro) j 

with the usual notation as given in Section 2 and Pi sets of rules over V associated 
with the regions of /i. 

The rules are of the following types: 

1. If a is an object, then a rule of the form a (y, come), where u is a string 
of objects, can be used in the skin membrane only; 
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2. If a is an object and c is a catalyst, then a rule of the form ca (c,tar) 
(v, come), tar G {here, in}, where is a string of objects, can be used in the 
skin membrane only; 

3. If a and b are single objects, then a rule of the form a — > {b, tar), tar € 
{out, in}, can be used in any membrane; 

4. If a, b are single objects and c is a catalyst, then a rule of the form ca 
{c,tari){b,tar 2 ),tari G {here, in, out} and tar 2 G {out, in}, can be used in 
any membrane; 

5. If a is an object, then a rule of the form a {a, tar), tar G {here, out, in}, 
can be used in any membrane; 

6. If a is an object and c is a catalyst, then a rule of the form ca (c, tar){a, tar), 
tar G {here, out, in}, can be used in any membrane; 

The application of a rule ca —>■ {c, tar) {v, come) in the skin membrane con- 
taining a multiset w means to remove a copy of the object a in the presence 
of the catalyst c, provided that such copies exist, and adding the objects spec- 
ified V to it. At the same time, the catalyst c will follow the prescription given 
by tar: if tar is here, then the catalyst c remains in the same region; if tar 
is in, then the catalyst c will be introduced in one of the inner membranes. 
The main idea behind the application of this rule is to send one copy of a, in 
presence of the catalyst c, out of the skin membrane and at the same time to 
bring the objects specified by v into the skin membrane. The application of a 
rule ca {c, tar) {a, tar) in a region containing a multiset w means to remove 
a copy of the object a in presence of the catalyst c, provided that such copies 
exist, then to follow the prescriptions given by tar. The application of a rule 
ca — > c{b,tar),tar G {out, in}, is similar. Similarly, a single object can move 
either into a membrane or out of the membrane by transforming into another 
object by rules of the form a {b, tar), tar G {out, in}. The use of r and 6 for 
varying thickness of the membrane are as discussed in Section 2. Computations 
are defined in the usual manner. The family of sets of vectors of natural num- 
bers N{n) generated by a P system with communicating rules with at most n 
moving catalysts, the actions 6, t, and of the degree at most m > 1, using target 
indications of the form here, out, in, is denoted by NPCm{MCatn,i/o,6,T). 

Theorem 4. PsRE = N PCi{MCat^,i/ o,t,S) 

Proof. We proceed as in the previous proofs: starting from a matrix grammar 
in the strong binary normal form, G = (N, T, S, M, F), we construct a P system 
with communicating rules of degree 4 as 

n = {V,T,C, fj,,wi,W2,W3,Wi, Ri, R 2 , Rs, Ri), 



where 



P = W U A^2 U {X',Xi,A, I XGfVi,AGfV2,l<i<2fc} U {Xl,X,^ | XeNi,l<i<h} 
U A" , A',, A" I A G iVi, A G A 2 , 1 < i < fc - 1} 

U {Z, Ei,E2,}} U {cq, Cl, C2}; 
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C = {co,Ci,C2}; 

= [l[2[3[4]4]3]2]i; 

W2 = {co,ci,C2,X,A}; 

wi = 1V4 = 4>] 

W3 = {El}; 

R\ contains the following rules: 



1 . ^2*- 


.1 ^ (X 2 iZ, come)r, 1 < i < k: 


) 


2. 4l2i_ 


1 (A 2 i, come)S, 1 < z < fc; 




3. X 2 . - 


{X' 2 ^Z, come)T, 1 < i < k — 


1; 


4. A2i- 


{A' 21 , come)S, 1 < i < k — 1; 




5. ■ 


{Xij^Z, come)T, 1 < i < k — 


1; 


6- ^2i ■ 


{A' 21 , come)S, 1 < i < k — 1; 




7. X!^,- 


{X 2 i+iZ, come)T, 1 < i < k 


-1; 


8. - 


(Gl 2 i+i, come)S, 1 < i < k — 


1; 


9. X2i - 


{Y' Z,come)T, for rrii : {X - 


-^r,2 


10. X 2 . ■ 


{ZX, come)r, for rrii : {X 


• A,2l 


11. ^2. - 


{x, come)S, 1 < i < k; 




12. coF' 


{coY,in); 




13. 


(A, out); 




14. 


(t,zn); 




15. A^ 


{A, in),A& N 2 ; 




16. X2k 


(t,zn); 




17. A2k 


(t.fo); 




18. 


(a, out), a G T; 




i ?2 contains the following rules: 




1. X ^ 


> {Xi,out)r; 





x), 1 < i < fc; 

x), 1 < z < fc; 



2. cqA (cqAi, out)S; 

3. ^ [cjXlin), for m[ : (X ^ ^ \),j G {1,2}; 

4. A — > {A^ in } , A G X 2 ; 

5- t ^ t; 

i ?3 contains the following rules: 

1. El {E 2 , in)T; 

2. E 2 — > {Ei,in)S; 

3. CjX^ (cjY,out), for m{: {X ^ ^ 1 ^ z ^ G {1,2}; 

4. Cj{\,in),j G {1,2}; 

5. A ^ {A,out), A & N 2 ; 



i ?4 contains the following rules: 

1. i?i ^ {El, out); 

2. E 2 {E 2 , out); 

3. t-t; 
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The system works as follows. 

Initially, membrane 2 contains the objects Cq, Ci, C 2 , X, and A. If we want to 
simulate a matrix m-i of type 2, then the object X will be replaced with Xi and 
sent out while increasing the thickness of membrane 2. At the same time, the 
object A with the help of the catalyst cq will be replaced with Ai and sent out 
while decreasing the thickness of membrane 2. Since we apply these two rules 
simultaneously, the thickness of membrane 2 will not be changed. On the other 
hand, if we apply only X (Xi, out)r, then a trap symbol will be introduced 
by using the rule Z (tO^) (since we can not apply the rule Z (A, out)). 
Similarly, if we apply only cqA —> (cqAi, out)S, then the whole system will be 
destroyed by using a rule Ai —>■ (A 2 , come)S in the skin membrane. So, after the 
simultaneous application of the rules X {Xi,out)r and cqA {coAi,out)S, 
the skin membrane gets Xi and cqAi. The subscripts of X and A are incremented 
step by step. At any time we can apply the rules X 2 i (Y' Z,come)T and 
A 2 i (x, come)S. If we apply these two rules in different steps, then either a 
trap symbol will be introduced or the whole system will be destroyed. In the 
next step, Y' will be replaced with Y and sent into membrane 2 and all A G N 2 
will also be sent to membrane 2. Thus, we can simulate a matrix rrii of type 2. 

If we simulate a matrix rm of type 4, then the procedure is the same as above, 
except in the last step, where we apply the rule X 2 i {Z\, come)T instead of 
X 2 i {Y'Z,come)T. After simulating a matrix rrii of type 4, if there is any 
symbol A G N 2 is present in membrane 2, then it will keep on evolving by using 
the rules A —> {A, in) in membrane 2 and A —>■ (A, out) in membrane 3, so that 
the computation never halts. 

If we simulate a matrix m' of type 3, then we use the rules cjX {cjXl ,m), 
for m' : (A ^ f),j e {1,2} and A (A, in), A G A 2 . If we use the 

rule CqA (cqAi, out)S simultaneously with above set of rules, then the whole 
system will be destroyed by using the rule Ai {A 2 , come)6. Once the objects 
Cj,Xl and A{g N 2 ) enters membrane 3, its thickness will be increased to 2 (due 
to simultaneous application of the rule E\ (E2,in)T). As the thickness of 
membrane 3 is two, we can not apply the rules Aj' ^ {cjY, out), for m' : (A ^ 
y, f), j G (1, 2} and A {A, out). At this step, if there is a nonterminal 

B^^'^ in the membrane 3 then we can apply the rule CjB^^'^ 0(t, J G {1, 2}, 

so that the computation never halts. Otherwise, after reducing the thickness to 
1 (by using the rule E 2 (Ei,in)S) we can apply c^A^ ^ (cjY,out), for 
m' : (A ^ Y,B^^^ f),j G {1,2} and A {A, out). In this way we can 
simulate a matrix m' of type 3. Thus, the equality Et{L{G)) = A(7T) follows. 

□ 



6 P Systems with Membrane Creation 

P systems with membrane creation [4] constitute a variant of P systems where 
each membrane contains both productive and non-productive objects. A produc- 
tive object is an object which can create a new membrane and transforms into 
another object. A non-productive object only transforms into another object 
without creating a new membrane. 
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Formally a P system with membrane creation of degree (m,n), n > m > 1, 
is a construct 

II — (U, IF, (F, /r, , * * * , Win 1 1^1 1 ‘ ‘ ‘ 1 Pn) 1 

with the usual notation as given in Section 2 and Ri, 1 < i < n, are finite set of 
evolution rules over V. The evolution rules are of two types: 

1. If a is a non-productive object, then the evolution rule is in the form a v 
or ca (c, tar)v, where c G C, tar G {in, out, here}, a G {V — C), v = v' 
or V = v'6 or V = v't, where v' is a multiset of objects over {{V — C) x 
{here, in, out}) and 5, r are special symbols not in V. 

2. If o is a productive object, then the evolution rule is in the form a — > [i v]i 
or ca (c,tar)[i v]i, where c G C, tar G {in, out, here}, a G {V — C), v 
is a multiset of objects over (V — C). A rule of the form a ^ [i v]i means 
that the object identified by a is transformed into the objects identified by 
V, surrounded by a new membrane having the label i. 

The family of all sets of vectors of natural numbers N{II) generated by P 
systems with membrane creation with at most r moving catalysts and the actions 
of both S, T, and of degree (jn, n),n> m> 1 is denoted by 

NPMC(^„i,n){MCatr, i/o, 6, r). 

In [6], it is proved that 

PsRE = NPMC(^i^ 4 ){Cat,tar, r, d). 

Here we improve this result by achieving PsRE with the degree (1,3), but 
making use of moving catalysts. 

Theorems. PsRE = NPMC(^i_^){MCat^,i/o,nT,S). 

Proof. We proceed as in the previous proofs: starting from a matrix grammar 
in the strong binary normal form, G = (N, T, S, M, E), we construct a P system 
with membrane creation as 

n = (y, T, c, /i, wi, R\, i?2, As), 

with the following components : 

y = fVi U fVa u {X', Xi\X G Ni,l<i<k} 

\J{X],X^ \ X GNi,l<i<h} 

U {Ai I A G iVa, 1 < i < fc} 

U {E,Ei,f, Z,]} U {co,ci,C2} 

C = {co,ci,C2}; 

h = [i]i; 

= {cq. Cl, C2, A, A}, for {S ^ X A) being the initial matrix of G; 
and the sets Ri, R 2 and A3 contains the following rules: 
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- i?i : 

1. X ^ [2X']2; 

2. X ^ [sEXi]^, for m' : (X ^ Y, ^ f); 

3. coA ^ {coAuin),A& N 2 - {B^^\bA)}; 

4. CjB(A B(A, in), j G {1,2}; 

5. ^ Y, for m' : (X ^ Y, B^A ^ j); 

6. Ai ^ -\,1 < i < k; 

7. / ^ [3A]3; 

8. Z^A; 

9.0^ (a, out); 

10. t-t; 

~ R2 : 

1. X'^Xi; 

2. Xi^X,+i,l<i<(fc-l); 

3. A, A,+i,l< i < (fc- 1); 

4. Xi ^ YS, for rm: {X ^Y,A^ x),X G Ni,Y G NiU {/}; 

5. Ai xZ, for m,:{X ^Y,A^ x),X G Ni,Y G Ni\J {/}; 

6 . 

7. Xfc ^ t; 

8. Ak t) 

9. t-t; 

- i?3 : 

1. E ^ Ei; 

2- CjXf Cj\,j G {1,2}; 

3. El XS; 

4. Ai^ A,AgN2-{BA),b'^A}- 

5. A^A,AgN2] 

The system works as follows. 

Initially, the skin membrane contains the objects X, A and the catalysts 
co,ci,C 2 . Here X is a productive object and creates a membrane. In order to 
simulate a matrix of type 2 or 4, the object X will create a membrane with label 
2 and transforms into X'. Whenever a membrane with index 2 appears, with the 
help of catalyst cq, a non-productive object A G N 2 will move into membrane 2 
as Ai. At the same time, X' will transform into Xi. Once we have Xi and Ai 
in membrane 2, the subscripts of X and A are increased step by step. At any 
time we can apply the rules Xi YS and Ai —>■ Zx. But if we apply one of 
these rules first, then a trap symbol will be introduced. So, in order to get the 
correct simulation, we have to apply these two rules at the same step. In this 
way we can complete the simulation of a matrix of type 2 or 4. If the symbol 
A in membrane 2 is not the one in the matrix rrii : (X Y, A ^ x), then we 
cannot apply the rule Ai Zx and a trap symbol will be introduced in the skin 
membrane by the rule ^ f . After simulating a matrix Wj : (X ^ /, A ^ x) of 
type 4, / will create a membrane with label 3 so that if there is any nonterminal 
A G N 2 present in the skin membrane, then the computation never halts. 
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For simulating a matrix of type 3, X will create a membrane with label 3 
and transform into EX^^\ If there is a symbol present in the skin mem- 
brane, then with the help of a catalyst Cj, it will move into the membrane 3. 
In membrane 3, the object E will be replaced with E\ and at the same time 
the object X^^\ with the help of a catalyst Cj (if exists in membrane 3), will 
be replaced with a trap symbol (f) and the computation never halts. Otherwise, 
in the next step, the object Ex will be erased and thus dissolve membrane 3. 
In this way we can complete the simulation of a matrix m(. Thus, the equality 
Et{L{G)) = N{n) follows. (It should be noted that here even though we are 
using 3 membranes, the depth of the membrane system is always 2 only.) □ 

7 Conclusion 

In this paper we considered several variants of P systems with symbol-objects 
where the catalysts were allowed to move in and out of a membrane. We have 
proved the universality results for these variants of P systems with a very small 
number (2, 3 or 4 depending upon the case) of membranes. We feel that this is 
the optimum, i.e., to achieve universality we require this many membranes in 
each case. It still remains to prove the optimality rigorously. 
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Abstract. We introduce A-subset transforming systems as a general- 
ization of multiset transformation. A A-subset, which is a generalization 
of a multiset where “multiplicities” take values in a semiring, is consid- 
ered by S. Eilenberg. We construct an example of A-subset transforming 
system which models a chaotic discrete dynamical system. We show that 
for every basic reaction of multiset transformation we can construct a 
A-subset transforming system which expresses the multiset transforma- 
tion. We also show that for every phrase structure grammar there is a 
A-subset transforming system such that the system simulates derivations 
of the grammar. 



1 Introduction 

Recently a number of new computing models are proposed, such as quantum 
computing [14], DNA computing [12], membrane computing or P systems [3,9,10,11], 
and so on. These new models and the “traditional” models, such as Turing ma- 
chines, phrase structure grammars, term rewriting systems, cellular automata, 
etc, have a quite different appearance, but we can find a common feature of them: 
they all obey a discrete time development. We can say that computational sci- 
ence is a science of discrete dynamical systems; by contrast, the physical science 
have been described by continuous differential equations. 

In this paper we try to build a general framework of discrete dynamical sys- 
tems. We adopt A-subset [5] to express objects in discrete dynamical systems. 

A A-subset is a generalization of a multiset as it is a multiset where “multi- 
plicities” take values in a semiring. Among many varieties of multiset theory, 
A-subset has a firm theoretical background [4]. Our model has a set of rules 
which are pairs of a condition and an action over A-subsets. Adding an initial 
A-subset, we obtain a A-subset transforming system. 

A multiset transformation is a good model of a chemical reaction. How- 
ever, in such a situation when a molecule changes its conformation according 
to a concentration of other molecules, such as pH or calcium ion, rational or 
real “multiplicities” will be useful. Similarly, in quantum computing, the “mul- 
tiplicities” of quantum states are complex numbers. This is why we introduce 
non-integral multiplicities. 

After preliminaries describing semirings and A-subsets, we define A-subset 
transforming systems and give an example which models a chaotic discrete dy- 
namical system (Section 3). Then we mention the relation between A-subset 
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transforming systems and multiset transformation (Section 4). Finally, in Sec- 
tion 5, we show that iF-subset transforming systems encompass string rewriting 
systems: phrase structure grammars and L systems. Although some results in 
this paper are obtained by only simulating existing models with AT-subset trans- 
forming systems, we believe that A'-subset transforming systems open up new 
research vistas in computer science. Example 2 and Theorem 5 suggest the wide 
possibilities of AT-subset transforming systems. 

2 Preliminaries 

First we introduce the notion of semiring from [5]. A set K is said to be a semiring 
if K has two operations, addition and multiplication, AT is a commutative monoid 
with respect to addition, it is a monoid with respect to multiplication, and 
addition and multiplication are connected by the following equations 

x{y + z) = xy + xz, 

{x + y)z = xz + yz, 
xO = 0 = Ox, 

for every x,y, z G AT, where 0 is the unit element with respect to addition, -|- 
stands for addition, and omitted • stands for multiplication. The unit element 
with respect to multiplication is denoted by 1. Thus for every element x,y,z & K 
we have 



x + y = y + x, 

X + {y + z) = {x + y) + z, 
x + Q = x, 
x{yz) = {xy)z, 

\x = X = xl. 



Clearly, any ring is a semiring. We give a few examples of semirings which 
will appear in this paper. 

Example 1. The following sets are all semirings. 

1. B = {0, 1} with the operations: 

0-b0 = 0, 0-bl = l-b0=l, l-bl = l, 

00 = 01 = 10 = 0 , 11 = 1 . 

So 0 is the unit element for addition and 1 is the unit element for multipli- 
cation. Notice that B is different from GF(2)^. 

2. The set of all nonnegative integers U. 

3. The set of all real numbers H. 

GF(2) is the only field of two elements and satisfies 1 -t 1 = 0. 



1 
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A semiring K is called commutative if for every x,y € K we have xy = yx. 
Then we define Ai-subsets [5]. We assume that AT is a nontrivial semiring, 
i.e., 0 yf 1, or equivalently AT has at least two elements. We also assume that AT 
is commutative. Let A be a set. A K-subset A of A is a function 

A-.X^K. 

For every x G X the element A{x) of K is called the multiplicity with which x 
belongs to A or multiplicity of x in A. The union U and the intersection n of 
two A-subsets A and B oi X are defined by 

(A U B){x) = A{x) + B{x) and 
(A n B){x) = A{x)B{x). 

For every k G K the operation kA is defined by 

(kA){x) = kA{x). 

We note that every ;B-subset B of X corresponds a normal subset S' of A by the 
next equation. 

S = {a; G A I B{x) = 1}. 

A “normal” multiset is a collection in which elements may be duplicated, for 
example, 

{a, a, a, b, c, c}. 

The number of times an element occurs in a multiset is called its multiplicity. 
In this paper we denote a multiset in the form 

{multiplicity of a • a, multiplicity of & • 6, . . .}, 

so the multiset above is expressed as 



{3 • a, 1 • 6, 2 • c}. 



Then we have the next proposition. 

Proposition 1. For a set X, there exist functions p, and v such that p maps 
every H-subset A of X to a multiset p{A) over X, v maps every multiset M 
over X to an Jl-subset v{M) of X, and pv and vp are identities. 

Proof. The functions p and n are defined by 

m(^) = U ■ x} 

x^X /\A{x)'>0 



and 



v{M){x) 



the multiplicity of a; in M 

0 



Then the conclusions are obvious. 



if X G M 
otherwise 



□ 
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For every x,y € M we define x — y hy 

( X — y if X > y 
X — y = < ^ 

f 0 otherwise 

A semiring has some relations over its elements: every semiring has equality 
= and inequality and if a semiring is a ordered set, then it has two more binary 
relations, < and <. Let A : X ^ K he a AT-subset. A term over the AT-subset 
A is defined by: 

1. An element a G AT is a term. 

2. For every x G X, A{x) is a term. 

3. For a variable letter z, z and A(z) are terms. 

4. For terms ti and t 2 , ti + 12, t\t 2 , and (ti) are terms. 

5. Nothing else is a term. 

A predicate over the AT-subset A is defined by: 

1 . R{ti , . . . , is a predicate where R is an n-ary relation over K and ti, . . . ,tn 
are terms over A. 

2. For two predicates Pi and P 2 , PiV P 2 , P\ A P 2 , and ~^Pi are predicates. 

3. For a predicate P and a variable letter z, {3z G X){P), (Vz G X){P), 
(3z G K){P), and (Vz G K){P) are predicates. 

4. Nothing else is a predicate. 

A predicate is called closed if every variable is in the scope of a quantifier. A 
closed predicate is satisfiable on a A'-subset A if the following conditions hold. 

1. For (3z G X){P), there exists x G X such that P\a(z)^a{x) is true where 
P\a(z)^a(x) stands for the predicate in which every occurrence of A{z) in P 
is replaced with A{x). 

2. For (Vz G X){P), P\a{z)^a(x) is true for every x G X. 

3. Pi V P 2 is satisfiable if and only if Pi or P 2 is satisfiable. Pi A P 2 is satisfiable 
if and only if Pi and P 2 are satisfiable, and ^P is satisfiable if and only if P 
is not satisfiable. 

Unless otherwise stated, for every predicate P, we assume that Vi (P) and 3i (P) 
stand for (Vi G ®)(P) and (3i G W)(P), respectively. 

3 Definitions of if-Snbset Transforming Systems 

Here we give the definition of a A'-subset transforming system. 

Definition 1. A AT-subset transforming system is a 4~tuple G = {X, K, R, Aq) 
where X is a set, K is a semiring, R is a set of rules of the form 

(condition) : (action) 

in which condition is a closed predicate over K-subset X ^ K and action con- 
sists a set of formulas that give a new K-subset from the current K-subset, in 
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which the current K-suhset and the new K-suhset are denoted by A and A! , 
respectively. So an action consists of equations of the form 

A' {x\, . . . , Xn) = (a term over A) 

where Xi G X or Xi is a variable letter for 1 < i < n. // there are variable 
letters in an action which are quantified in the condition of the action, then the 
variable letters take values which make the condition satisfiable. We omit, from 
actions, the definition of multiplicities of elements of X which take the same 
multiplicities in the current K-subset. And Aq \ X ^ K is the initial K-subset. 

Let G = {X, K, R, Aq) be a iL-subset transforming system and let B ■. X ^ K 
be a iL-subset. Two rules ri and r 2 in R are said to be competing in the conditions 
on B if there are an element x G X and two variables z\ and Z2 occurring ri and 
C 2 , respectively, such that the conditions of ri and V2 are satisfiable for Z\ and 
Z2 to be assigned to x. Two rules ri and r2 are called overlapping in the actions 
if there is an element x G X such that ri has an equation 

A'{x) = (a term) 

and T 2 has an equation 

A' {x) = (another term). 

We note that it is generally undecidable whether two rules are competing in the 
conditions or not. Now we define derivations by a iL-subset transforming system. 

Definition 2. Let G = {X, K, R, Aq) be a K-subset transforming system and let 
B and B' : X ^ K be K -subsets. Then B' is said to be directly derived from B by 
G if there are rules ri , . . . , r„ such that their conditions are satisfiable on B and 
B' is obtained from B by the actions of the rules ri , . . . , r„ . If there are competing 
rules in the conditions on B, then one rule is selected nondeterministicly . If there 
are overlapping rules in the actions, then one rule is selected nondeterministicly. 
A sequence of K -subsets {Aq,Ai,...) is generated by G if and only if An is 
derived from An-i by G for n = 1,2,.... If there is no rule whose condition 
is satisfiable for a K-subset A„, then G derives no K-subset from A„ and the 
sequence is terminated at An. 

Now we give a simple but illustrative example. 

Example 2. Let G = ({y}, H, R, Af) be an H-subset transforming system where 
R consists of 



A{y) > 0 : A\v) = -2A{y) + 1, 
A{y) < 0 : A'{y) = 2A{y) + 1, 



and Ao{y) = xq for some xq G [—1,1]- We note that the derivation by G is 
deterministic because the two rules will never compete in the conditions. Then 
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the multiplicities of y in the sequence ( 2 I 0 , . . . , A„, . . .) generated by G give the 
trajectory of the discrete dynamical system 



^n+l — 2|x^| + 1, 



i.e., 



Xn = An{y). 



This is an example of chaotic dynamical systems (Example 6.2.1 of [8]). 



4 Multiset Transformation 

and ]N-Subset Transforming System 

The U-subset transforming systems are able to carry out multiset transforma- 
tions. For example, the sort program written in GAMMA [1,2,6] looks like: 

Example 3. Let G = {X, K, i?, Aq) be an H-subset transforming system where 

A = {1, . . . ,n} X {xi, . . . ,x„}, 1, . . . ,n G M, Xj G H 

Aq is an B-subset of X, and R consists of the rule 

3i3j (3x,y e ]R)(A((i, x))A((j, y)) >0A'i<jAx>y) : 

A'{{i,x)) = A((t,x)) - 1, A'{{j,y)) = A{{j,y)) - 1, 

4'((i,2/)) = A{{i,y)) + 1, A'{{j,x)) = A((j,x)) -b 1. 

Now obviously G generates a finite sequence (Aq, . . . , A^) such that 

i < j and x < y if Afe((i, x))Afc((j, y)) > 0. (1) 

Since there may be more than one combination of pairs (i,x) and (j, y) which 
make the condition of the rule satisfiable, G may generate more than one se- 
quences (Ao,...,Afe). It is easily proved that (1) holds for every A^ in every 
sequence. 

Example 3 is generalized to the basic reaction of GAMMA program. 

Theorem 1. Let X be a set and let 

G . Xi , . . . , Xn > A(xi , . . . , Xti) A(xi, . . . ,Xyi) 

be a basic reaction of multiset over X where xi, . . . , x„ are variables and R and 
A are of arity n [6J. Then there is an H-subset transforming system H such that 
G transform a multiset M to M' if and only if H generates Jl-subset M' from 
M. 
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Proof. Let H = {X, E, P, Af) be an E-subset transforming system where P 
consists of 

(3xi, ...,XnG X)R{xi, ...,x„) : B'{xi) = B{xi) + iy(A(xi, . . .,Xn))(xi) - 1, 

i = 1, . . . ,n 

B\y) = B{y) + v{A{xi , . . . , Xn)){,v) 
for y ^ {xi, . . . ,x„} 

and Ao = v{M[f) where Mg is the initial multiset for G and v is the function 
defined in Proposition 1. Then the conclusion follows immediately. □ 

We do not treat sequential and parallel composition operators of GAMMA 
[6]. But by Theorem 3 in Section 5, AT-subset transforming systems have com- 
putational universality. 

5 Phrase Structure Grammars, L Systems, 
and iGSubset Transforming Systems 

In this section we consider the relation between string rewriting systems and 
iL-subset transforming systems. We assume the reader is familiar with basics of 
phrase structure grammars (see [13]) and L systems (see [7]). 

Let A be a finite alphabet. A ,B-subset A of E x A is said to be linearizable 
if A satisfies 

1. For every f e E and a,b G S such that a^b vie have A{{i, a))A{{i, b)) = 0. 

2. For every i < j < k and every a,b,cG S vie have ~^{A{{i, a)) = lAA{{j, b)) = 
0 A A{{k, c)) = 1). 

For a linearizable B subset A, a mapping (/> : A ^ A* or A“ is defined by 

( Qi ■ ■ ■ Qj G S* if A((t, Oi)) = • • • = A((j, Oj)) = lA 
4>{A) = < A((fc, b)) = 0 for k < i,k > j 

[ Oi • • • Oj • • • G A“ if A((j, Qj)) = 1 for some i G E and every j >i 

Then the next theorem describes the fact that yB-subset transforming systems 
cover the phrase structure grammars. 

Theorem 2. Let G = {V, A, P, S) be a grammar. Then there exists a B-subset 
transforming system PI such that for every sentential form w generated by G 
there is a B-subset A o/ E x (A U A) generated by H which satisfies 

w = (j>{A). 

Proof. Let PI = (Ex (V U S), B, R, Ag) where Ag((l, S)) = 1, Ag((z, x)) = 0 for 
z yf 1 and X G (A U A), and R consists of the following rules: 

For every ai ■ • • Uk ^ bi ■ ■ ■ bi G P and I > 1 

3z (A((i, m)) = • • • = A((z -I- k, Ofc)) = 1 A oi • • • ^ 6i • • • 6;) : 
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A'{{i,h)) = --- = A'{{i + l,bi)) = l, 

A'{{i, ai)) = ■ ■ ■ = A' {{i + k, at)) = 0, 

A'{{i + 1 + j, c^+k+j)) = A{{i + k + j, Ci+k+j)) for j > I, and 
A'{{j, Cj)) = A{{j, Cj)) for j < i. 

For every ai ■ • • Uk ^ s G P, 



3i {A{i, oi)) = • • • = A{{i + k, ak)) = 1 A oi • • • Ofc ^ e) : 

A'{{i, ai)) = ■■■ = A'{{i + k, Uk)) = 0, 

A'((i + j - 1, c^+k+j)) = A{{i + k + j, Ci+k+i)) for j > 0, and 
A'{{j, Cj)) = A{{j, Cj)) for j < i. 

First we observe that every ,8-subset ^ of ® x (V^ U 27) generated by H is 
linearizable. Then the definition of 4> leads the conclusion. □ 

Since the above theorem says that iF-subset transforming systems can sim- 
ulate type 0 grammars, we have the following theorem. 

Theorem 3. The K-suhset transforming systems generate all recursively enu- 
merable languages. There is a K-subset transforming system generating a se- 
quence of K -subsets which is not recursively enumerable. 

Proof. The first assertion is a corollary of Theorem 2. Since a chaotic dynamical 
system shows quite different behaviour by any infinitesimal change in the ini- 
tial value, the different initial H-subset in Example 2 gives a different H-subset 
transforming system. So the cardinality of possible H-subset transforming sys- 
tems in Example 2 is the cardinality of continuum. Then the second assertion is 
true. □ 

We note that, by Church’s hypothesis, the class of effectively computable K- 
subset transforming systems must coincide with the class of Turing machines. 
Next we consider L systems. 

Theorem 4. Let G = (E,P,ff,w) be a (1, 1)L system where ff is the environ- 
mental marker not in E. Then there is a B-subset transforming system H such 
that for every u G E~^ derived by G, H generates the linearizable B-subset A of 
H X (if U {#}) satisfying 

M# = f){A). 

Proof. Let H = {X,B,R,Aq) be the ,B-subset transforming system where 

X = {-1} U If U {$} X H U M X (27 U {#}) U ({0, 1} x M) x (27 U {#}), 
Ao{{i, «i)) = 1, ^o((^ + 1) #)) = 1; smd Aq{x) = 0 for other x G X, 



where w = qq ■ • • ai, and R has the following rules: 

1. 3i{3a G 27 U {#})(A((*, a)) = 1) : Al'(((0, f), a)) = 1, A'{{z, a)) = 0. 
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2. Vi(Va G r U {#})(A((f, a)) = 0)A 

(3a, X G r)(yl(((0, 0), o))gI(((0, 1), x)) = 1 A (#, a,x) ^ bi ■ ■ ■ bk € P) : 
A'(((l,0),6i)) = ••• = A'(((l,fc- l),&fc)) = 1, 

Al'(((0,0),a)) = 0,A'(1) = 1)) = 1. 

2’. Vi(Va G 2: U {#})(A((f, a)) = 0)A 

(3a G r)(Gl(((0, 0), a))Al(((0, 1), #)) = 1 A (#, a, #) ^ 6i • • • G P) : 
A'(((l,0),6i)) = ••• = A'm,k-l),bk)) = 1, 

Al'(((0,0),a)) = 0,A'(-1) = l,A'm,k),#) = 1. 

3. 3j(Gl(j) = l)A3?(Al(($,0) = l)A 

(3a,x,y G i7)(A(((0, j - 1), x))^(((0, j), a))2l(((0, j + l),y)) = lA 
(x,a,y) ^ 6i •• - &fc G P) : 

A'(((l,/+ l),5i)) = ■■■ = A'm,l + k),bk)) = l,A'(((0,j),a)) = 0, 

A'{j + 1) = 1, A'(j) = 0, Al'(($, 0) = 0, A'(($, l + k)) = 1. 

4. 3j(A(j) = 1)A3UaI(($,0) = 1)A 

(3a, X G S){A{{{0,j - l),x))A(((0, j),a))A(((0, j + 1),#)) = lA 

(x,a,^) ^bi-'-bk&P). 

A'{{{l,l+l)M)) = --- = A'm,l + k),bk))=A'm,l + k+l),#)) = l, 
A'(((0, j), a)) = 0, A'{j) = 0, A'(($, /)) = 0, A'(-l) = 1. 

5. Al(-l) = 1 A 3z(3a G P U {#})(Al(((l, *), a)) = 1) : 

A'{{i,a)) = 1, A'(((l,f),a)) = 0. 

6. A(-l) = 1 A Vz(Va GPU {#})(A(((1, i), a)) = 0) : A'{-1) = 0. 

Now we show that for every P-subset Glof]Nx(PU{#}) satisfying = (j){A) 
for some u G P^, u v if and only if there exists a P-subset B which is 
generated from ^ by P and 4>{B) = v^. Let u = ag ■ ■ ■ a„_i where Oi G P, 
z = 0, . . . , n — 1, let Al((0, oo)) = ■ ■ ■ = A((n — 1, a„_i)) = A((n, #)) = 1, and let 
A((j, a)) = 0 for j < 0 or j > n. Then by iterating the rule 1 n + 1 times, we 
have P-subset Ai such that 

Ai(((0,0),ao)) = •• • = 2li(((0,n- l),a„_i)) = xli(((0, n), #) = 1. 

We note that rule 2 cannot be iterated until rule 1 is iterated n + 1 times by the 
first condition of rule 2: 

Vz(Va GPU {#})(A((i, a)) = 0). 

Next rules 2, 3, and 4 (or 2’) simulate the derivation of G from left to right. 
After the rule 4 (or 2’) is iterated, we have P-subset A 2 satisfying 

A2(((1,0),6o)) = ••• = A2(((l,m),6,„)) = A2(((1 ,to+ 1),#)) = 1, 

A 2 (- 1 ), and bg- ■■b^ = v. 

Finally rules 5 and 6 generate the desired P-subset B. Then it is proved that H 
generates P-subset A of K x (P U {#}) if only if G generates u G P"*" such that 

M# = HA). 

If G generates e, then H generates P-subset A^ and Ag derives nothing where 
Ae(((0, 0), #)) = 1, As{x) = 0 for other x G A. 
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Since all L systems generate e from e, this is also correct. □ 

The ,8-subset transforming system constructed in the above proof is quite 
unefficient. It simulates one step derivation of an L system by means of many 
steps. We should find it'-subset transforming systems which can generate strings 
in parallel. 

But by considering an B-subset of if* x B, we can measure the multiplicity 
of a word, that is, the total number of different derivations of a word in an L 
system. 

Theorem 5. Let G = (if,P, #,w) be an L system. Then there is an H-subset 
transforming system H such that for every word u derived by G in i steps H 
generates an H-subset A o/if* x B and that A{{u,i)) gives the multiplicity ofu. 

Proof. Let H = {S* x B, B,i?,xlo) where 4lo((a;,i)) = 0 for 

X w or i 0 and R consists of the following rule 

3i(3u S if*)(il((M, i)) > 0 A (3v S TJ*){u v)) : 

A'{{v, i + 1)) = A{{v, i + 1)) + A{{u, i)), A'((u, i)) = 0. 

Then obviously A((u,i)) gives the multiplicity of u derived by G in i steps. □ 
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Abstract. There are presented deterministic P systems with active 
membranes which are able to solve the Integer Factorization Problem 
in a polynomial time, which is the main result of the paper. There is 
introduced a class of programs for the description and correct imple- 
mentation of algorithms of elementary number theory in nonstandard 
computing systems, especially in P systems with active membranes. By 
using some of these programs there is achieved the main result. 



Introduction 

Recently in the area of membrane computing, a branch of molecular comput- 
ing, the known NP-complete SAT problem was successfully attacked by mem- 
brane systems (also called P systems) with active membranes, which ae able to 
solve this problem in a linear time by massively parallel computations. These 
P systems with active membranes are described in the pioneering paper Paun 
(2000a). Thus the old Integer Factorization Problem belonging to the class of 
NP-problems can be also solved in a polynomial time by P systems with active 
membranes. One can get this kind of a solution by the reduction of Integer Fac- 
torization Problem to SAT problem in order to apply those P systems which 
solve SAT problem in a linear time. 

We recall that the Integer Factorization Problem, formulated as a problem of 
finding a non-trivial divisor of a given natural number, is an important problem 
of number theory and public-key cryptography, cf. Menezes et al. (1996) and 
Koblitz (1998). 

In the present paper we present deterministic P systems with active mem- 
branes which solve Integer Factorization Problem in the time O(ln^n)^ for a 
given natural number n to be factored. We do not apply the cumbersome and 
time consuming reduction of Integer Factorization Problem to SAT problem (see 
the proof of Cook’s Theorem in Papadimitriou (1994), where there is given a gen- 
eral method of reduction of NP-problems to SAT problem) . There is only used 

^ For the O notation see Papidimitriou (1994) or Koblitz (1998). 
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a “school” algorithm for computing the remainder of the division of two binary 
presented natural numbers. We achieve the result by writing this algorithm in 
a form of a program belonging to a class of simple programs called liberal loop 
programs and related to loop programs defined in Meyer and Ritchie (1967). The 
liberal loop programs, introduced in the present paper, are aimed to describe al- 
gorithms of elementary number theory in terms of bit operations (cf. Koblitz 
(1998)) for correct and immediate implementation of these algorithms in non- 
standard computing systems, especially in P systems with active membranes. 

The methods presented in the paper can be also applied to define P systems 
with active membranes which solve Discrete Logarithm Problems (cf. Koblitz 
(1998)) in a polynomial time. 

In Section 1 of the paper we introduce liberal loop programs. Then in Sec- 
tion 2 we describe P systems with active membranes, called briefly P systems in 
the paper, and give a precise definition of processes generated by P systems. The 
processes generated by P systems are counterparts of processes of computation. 
Deterministic P systems solving Integer Factorization Problem are described in 
Section 3. In the Appendix we present the rules for an implementation of liberal 
loop programs in P systems. 

For all unexplained terms of complexity of computation we refer the reader 
to Papadimitriou (1994). 
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1 Liberal Loop Programs 

Liberal loop programs, briefly called ^-programs and related to loop programs 
defined in Meyer and Ritchie (1967), form a restricted class of programs for 
manipulating binary strings^ stored in registers. There is no limit on the size of 
registers and every register contains exactly one binary string. 

An Lprogram is a finite sequence of instructions of five types: 

(1) A := r, 

(2) X := A 

(3) X :=op(...,X,Y,Z,...) for op G BOP, 

(4) LOOP|A|, 

(5) END, 

where . . . , X,Y, Z, . . . are names of registers, and the symbols op G BOP^ are 
symbols of bit operations (cf. Koblitz (1998)). From now on, names of registers 

^ A binary string is a finite sequence of digits 0 and 1. 

® The set BOP may be specified separately for every Z-program according to the prac- 
tical need of particular bit operations to be used. 
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are denoted by capital Latin letters. The occurrences of instructions LOOP |X| 
and END in an Lprogram are matching like occurrences of parentheses “(’’and 
respectively in a well formed propositional formula, e.g., as in formula 

{{A A B) y ^{C)) A {B A C). 

An Lprogram V is realized by the execution of instructions in the sequential 
order in which they appear in V. The instructions are executed as follows: 

— according to X := Y the current content of register Y is transferred to 
register X, 

— X := A means to clear register X or equivalently to put the empty string A 
in register X, 

— X := op (. . . ,X,Y, Z, . . .) means to put the result (value) of a bit operation 
op determined by the current contents of registers . . . ,X,Y,Z,. . .in register 
A, 

— for an Z-program of the form 

(*) LOOP |A|-P END 

if the first occurrence of LOOP |A| and the last occurrence of END in this 
program are matching, then the first occurrence of LOOP |A| means to re- 
alize |a;|-times the piece V, where |x| is the length of string contained in 
register X upon entering the loop, 

~ if an /-program V' of the form (*) appears in another /-program V" and 
the first occurrence of LOOP |A| and the last occurrence of END in V' are 
matching, then the corresponding occurrence of LOOP |A| in V” is executed 
analogously as above. 

All basic conventions concerning /-programs are the same as the conventions 
for loop programs which are adopted in Meyer and Ritchie (1967), except the 
execution of instructions X := op(. . . , A, F, Z, . . .) and LOOP|A|. Thus, for 
more explanations we refer the reader to Meyer and Ritchie (1967) or Brainerd 
and Landweber (1974). 

For an /-program V we write RGR(T^) to denote the set of names of registers 
which occur in V. By input data T> for a program V we mean a family of binary 
strings sx {X G I) for a set / C RGR(T^) of input registers such that sx is 
contained in register X for every A G /, respectively, when realization of V 
starts. We assume that when realization of V starts, the non-input registers, i.e. 
belonging to RGR(T^) — I, contain the empty string. 

By the time of realization of a program V for input data V, written 
TIME(7^,I?), we mean the total number of executions of single instructions oc- 
curring in V during the realization of V for input V. By the space of realization 
of a program V for input data T>, written SPAGE(7^, I?), we mean the length of 
the longest string contained in registers of RGR(T^) during the realization of V 
for input V. 

For an /-program V we denote by \V\ the length of 7^ as a sequence of 
instructions. 
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We present now some /-programs describing certain simple algorithms of 
elementary number theory. We identify natural numbers with their binary pre- 
sentations if there is no risk of confusion. A finite string of digit 0 may appear 
on the left in a binary presentation of a natural number. We say that meaning- 
less zeros do not occur in a binary presentation of a natural number given by a 
binary string s if digit 0 occurs as the first symbol of s whenever s is of length 1 
and s presents number 0. 

The /-program 



W :=A 
Z -=A 
LOOP |a:| 

W-.= ^{X,Y,Z) 

Z :=(B{X,Y,Z) 

X:=i{X) 

Y:=m 

END 

W :=W\ Z 

computes the sum of two binary presented natural numbers m and n contained 
in the input registers X and Y respectively, where \m\ > |n| and \m\, \n\ are the 
lengths of binary strings presenting m and n, respectively. The symbols (t, 0, 

I, ! are elements of BOP. The binary presented sum m 0 n is contained in the 
output register W after the realization of the above program. The realization of 
the above program coincides with the “school” computation of the sum m 0 n of 
binary presented natural numbers m and n, where register Z is aimed to store 
digit 1 carried during the computation and where execution of new instructions 
is defined in the following way. According to instruction W := i{X, Y, Z) one 
writes digit 1 on the left end of the current content of register W if all registers 
X, Y, Z currently contain those strings whose last symbol is digit 1 or exactly 
one of the registers X,Y,Z contains that string whose last symbol is digit 1, 
otherwise one writes digit 0 on the left and of the current content of W. The 
instruction Z := 0(Ai, Y, Z) is executed in such a way that one puts one element 
string “1” in register Z if at least two of the registers X, Y, Z contain those strings 
whose last symbol is 1, otherwise one puts empty string in register Z. According 
to the instructions X := J,(AT), Y := J,(F) one erases the last symbol of the 
strings currently contained in registers X and Y. When registers X, Y contain 
the empty string, the execution of these instructions leaves the empty string 
contained in them. According to instruction W := W \ Z one writes on the left 
end of the current content of register W the last symbol of that string which is 
currently contained in register Z. When register Z contains the empty string, the 
content of register W remains unchanged after the execution ofW := W I Z. We 
use expression V[W := X -\-Y] as an abbreviation of the program written above. 
The realization of the /-program V[W := X -\-Y] does not save the input data 
contained in the registers X and Y because of the execution of instructions X := 

J, (AT) and Y := i(P). One avoids this defect by writing a new /-program which 
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is juxtaposition of instructions P := X, Q := Y, and V[W := P + Q] given by 

P:= X 
Q:=Y 

V[W :=P + Q] 

where P[W := P + Q] is the /-program resulting from the replacement of occur- 
rences of X,Y in V[W := X + Y] by occurrences of P and Q, respectively. We 
denote the improved program by V[P := X,Q := Y,W := P + Q]. 

The /-program 



W:=A 
Z := A 
LOOP |X| 

W:=UX,Y,Z) 

Z:=e (X,Y,Z) 

X:=l(X) 

r:=i(r) 

END 
LOOP |Z| 

W:=A 

END 

computes a function described by 

_ J the binary presented subtraction m — n, if m> n, 

\ the empty string A, otherwise 

for two binary presented natural numbers m and n contained in the input reg- 
isters X and Y, respectively, where \m\ > |n|, and \m\, |n| are the lengths of 
binary presentations of m and n, respectively. The symbols U, 0, J, are elements 
of BOP, where the meaning of jj, and J, has been already explained in the case of 
the program for computing the sum m + n. 

The value m-7n is contained after realization in the output register W. The 
realization of the above program coincides with the “school” computation of 
subtraction m — n of binary presented natural numbers m and n, where register 
Z is aimed to store digit 1 borrowed during the computation. The execution of 
instruction Z := Q{X,Y, Z) provides correct storing of borrowed 1 in register 
Z and is similar to the execution of instruction Z := (B(X,Y, Z). We use the 
expression V[W := X-^Y] as an abbreviation of the above program. We improve 
the program V[W := X -h-Y] by writing new /-program 

J := X 
K :=Y 

V[W := J^K] 

whose realization saves the input data contained in registers X and Y. The 
improved program is denoted by V[J := X,K := Y,W := I ^ K], 
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We use the following auxiliary ^-program 

M -=Y 
LOOP|i?| 

END 
T ■= M 
T :=AST 

which verifies whether \y\ < |r| for two binary strings y and r contained in the 
input registers Y and R, respectively, where \y\ and |r| denote the lengths of y 
and r, respectively. The output register T contains after realization one-element 
string “ 0 ” if |y| < |r|, otherwise T contains the empty string. The execution of 
instruction T := A 0 T means to put one-element string “ 0 ” in register T if 
register T contains currently the empty string, otherwise to clear register T. 

We write V\T := |F| < |i?|] to denote the above auxiliary program. We write 
also V[T := yl 0 M] to denote the two-instruction program 

T ■= M 
T := AST 

which verifies whether register M contains the empty string, where the execution 
of instruction T := T 0 T is defined as above. 

We describe now some useful constructions of new /-programs from old l- 
programs. If V is an /-program, then the juxtaposition of instruction LOOP |X|, 
the program V, and instruction END, written 

LOOP|X| 

V 
END 

is an /-program. For two programs V and V' their juxtaposition, written 

V 

V 



is in an /-program. 

Hence if the realization of an /-program V verifies a property (P(T>) of input 
data T> such that after the realization the output register T contains a string 
of length 1 when (P(T>) holds, otherwise T contains empty string, then for two 
/-programs V' and V" the juxtaposition given by 

V 

LOOP \T\ 

V 

END 

T ■= AST 

LOOP \T\ 

-p" 

END 
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is an ^program whose realization means the computation according to the flow 
diagram 




where the name T does not appear on the left hand side of the symbol := in 
any instruction occurring in V' . The execution of T := A0T has been already 
defined. 

By using these constructions we can introduce more complex programs. 

The ^-program written in Fig. 1 computes the remainder rem(m/n) of the 
division of m by n of two binary presented natural numbers m and n contained 
in the input registers X and Y respectively, where m > n > 0 and meaningless 
zeros do not occur in the binary presentations of m and n. The result rem(m/n) 
written as a binary string is contained in the output register R after the realiza- 



R-.= A 
LOOP|y| 

R~RkX 
X ~ T(^) 

END 
U --X 
U :=USzO 
LOOP|t/| 

V[T:=\Y\<\R\] 

LOOP \T\ 

V[J := R,K --YW := J ^ K] 
V[E :=ASW] 

E--ASE 

LOOP|E| 

R--W 

END 

END 

V[S := T©X] 

S ■- ASS 
LOOP I S'! 

LOOP|7?| 

R~0^ R 
END 

R — R&iX 
X ~ux) 

END 

END 



Fig. 1. 
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tion of the program. The realization of the program in Fig. 1 coincides with the 
“school” computation of remainder of division of two natural numbers, where 
the execution of new instructions is defined in the following way. According to in- 
struction R := REzX one writes the first symbol of the string currently contained 
in register X on the right and of the string currently contained in register R. 
If register X contains the empty string, then the content of register R remains 
unchanged after the execution of this instruction. The execution of instruction 
U := U Sz 0 means to write digit 0 on the right end of the current context of 
register U. 

According to instruction X := t(-^) one erases the first symbol from the 
current content of register X. If X contains the empty string, then its content 
remains unchanged after the execution of this instruction. 

According to instruction i? := 0 "f R one erases the first symbol from the 
current content of register R if this symbol is digit 0, otherwise the content of 
register remains unchanged, also when R contains the empty string. The above 
new instructions are described in terms of bit operations. The program written 
in Fig. 1 is denoted by V[R := veu\{X/Y)]. 

The program written in Fig. 2 checks whether a binary string s presents a 
non-trivial divisor of a binary presented natural number m for m and s contained 
in the input registers X and Y , respectively, where 1 < |s| < \m\ and meaningless 
zeros do not occur in m. If s does not present a non-trivial divisor of m, then the 
output register A contains the empty string after the realization of the program. 

LOOP|y| 

y := 0 T y 

END 
B — Y 
B ■- \B\ < 1 
LOOP|B| 

A — A 
END 

B — AS B 
LOOP|B| 

V[R ■- rem(A/y)] 

LOOP|i?| 
i? := 0 T i? 

END 

r.-ASR 

LOOP|i?| 

A--Y 

END 

R — ASR 

LOOP|i?| 

A--A 

END 

END 



Fig. 2. 
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If s presents a non-trivial divisor of m, then after the realization of the program 
the output register A contains that non-empty binary string which is the result 
of erasing of meaningless zeros in s. 

New instruction B := \B\ < I means to put one element string “0” in register 
BUB contains currently a string of length no greater than 1, otherwise to clear 
register B. 

The program written in Fig. 2 is denoted by Vfct- 

Proposition. Let input data V for program V[R := re\n{X/Y)] he given by 
binary strings sx and sy which present natural numbers m and n, respectively 
such that m > n > 0 and meaningless zeros do not occur in both sx and sy . 
Then we have that 



TIME(-P[i? := rem(X/r)],X>) < C • |sx| • I 
and SFACF,lv[R := rem(X/r)],X>) = |sx| 

for some constant natural number C > 1, where |sjc| and |sv| denote the lengths 
of strings sx and sy , respectively. 

Proof. One proves this proposition by a careful inspection of the realization of 
the piece of V[R := rem{X /Y)] between the occurrence of instruction LOOP \U\ 
and the occurrence of END matching with the occurrence of LOOP |17|. 

Corollary. Let input data V for program Vfct be given by binary strings sx 
and sy such that 1 < |sy| < |sx| and sxpresents a natural number such that 
meaningless zeros do not occur in sx- Then we have that 

TIME(Pfct,I?) < C • |sx| • |sv| and SPACE (Pfet, P) = |sx|. 

for some constant natural number C > 1, where |sx| and |sv| denote the lengths 
of Sx and sy, respectively. 

Proof. This is an immediate consequence of the above proposition. 



2 P Systems 



We recall that for a set A a multiset of elements of A is a function f : A ^ N 
defined on A and valued in the set N of natural numbers with 0. For two multisets 
f,g:A^Nwe define their sum f + g and difference / -^ g to be the multisets 
of elements of A which are given by 

(/ + g){a) = /(a) -h g(a), (/ - g){a) = /(o) - g{a) for all o G A, 



where m — n = m — n for m > n, otherwise m — n = 0. 

For an element o of A we define a multiset {a) : A ^ N hy 



{a) {a') 



1 if o' = a, 

0 otherwise. 
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If tu is a finite sequence (string) oi . . . a„ of elements of A, we define the multiset 
(w) or (oi . . . a„) by 

n 

(w) = ^(Oi) = (oi) + . . . + {an). 
i=l 

We use the symbol O to denote the empty multiset, i.e. that constant function 
defined on A whose value is equal to 0. For two multisets /, g we write f < g A 
/(a) < 5(a) for all a G A. 

We recall according to Paun (2000a), (2000b) that a membrane system S is 
given by the following data: 

— a finite non-empty set B of balls of finite diameters > 0 (in Euclidean space 
E" for n > 1) such that the frontiers"^ of the balls contained in B are pairwise 
disjoint sets, and there exists the greatest ball in ® with respect to the 
inclusion relation C; the set B is called the underlying set of balls of S. 

— three labelling functions / : B ^ L, e : B ^ {-,0,-1-}, M : B ^ , where 

L is the set of labels of S, the function e assigns the electric charge e{b) to 
every ball & € B, O is the set of objects of S and M is a function valued in 
the set of multisets of elements of O. 

For a ball & G B the multiset M{b) describes the numbers of copies of individual 
elements of O contained in b in the space (region) between the frontier of b and 
the frontiers of those balls in B which are immediate subsets of b. 

For two balls bi, 62 of the underlying set B of a membrane system we write 
61 A 62 iff bi is an immediate subset of 62, i.e., bi ^ 62, and there is no any other 
ball 6' G B such that b\ <^b' <^b 2 - 

The membrane systems are transformed into new membrane systems by ap- 
plications of the evolution rules, described in Paun (2000a), (2000b), to the balls 
and objects of systems. We consider in the paper the following five types of 
evolution rules which are written according to the conventions adopted in Paun 
(2000a): 

( 1 ) [hU^wff, 

( 2 ) - [hc]u [hd\t, 

(3) [h<Ah c, 

l'4'l [ [ 1 - [ 1+ 10 ^ r r 10 10 r r 10 10 

V / ihQihx L/12 ihQihi ihi-ihQ \-hQih2 ih 2 ^hQ^ 

(5) [h<Al [h ]hC^ 

where u, w are finite sequences (strings) of objects of a given membrane system, 
a, o! G {— , 0, -I-}, a, c, d are objects of a given membrane system, and h, ho, hi, 
ft-2 are elements of the set of labels of a given membrane system. 

The applications of the evolution rules of the above five types are described 
in the following way for a membrane system S given by B, I, e, M. 

^ The frontier of a ball {x G F" | d{s,x) < r} is the sphere {x G E" \ d{s,x) = r}, 
where d{s, x) is the distance between the center s and *. 
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Ad (1). A rule of this type applies to those balls 6 G B for which l{b) = h, 
e(b) = a, and (u) < M{b). According to this rule there is only changed the 
content M{b) of & into a new M'{b) given by 

M'{b) = (M{b) - {u)) + {w). 

Ad (2). A rule of this type applies to those innermost® balls 6 G B for which 
l{b) = h, e{b) = 0, and (a) < M{b). According to this rule a ball b is divided (or 
replaced by) into new innermost disjoint balls b~ , b~^ of opposite electric charges 
e{b~) = e(6+) = +, and l{b~) = l{b~^) = h, M{b~) = {M{b) — (a)) + (c), 

M{b~^) = (M{b) — (a)) + (d). Thus the old ball b is deleted from the system. 
The new balls b~ , b~^ are immediate subsets of that ball 6' G B for which b <b' 
before division of b. 

Ad (3). A rule of this type applies to those innermost balls 6 G B for which 
l{b) = h, e{b) = a, and (a) < M{b). According to this rule a ball b is deleted 
from the system. There is also changed the content M{b') of that ball 5' G B for 
which b <b' before deletion of 6. A new content M'{b') is given by 

M\b') = M{b') + ((M(5) ^ (a)) + (c)). 

Ad (4). A rule of this type applies to those three different balls 6i, 62, € B 

for which we have that 

— b\ is an immediate subset of the greatest ball 60 G B, 62 ^ &i, 63 b\ and 

there is no any other ball 6 G B such that b <b\, 

— e(6i) = 0, e(&2) = — , e(&3) = +, l{bi) = ho, ^(^2) = hi, l{b3) = /i2- 

According to this rule the ball bi is divided (or replaced by) into new disjoint 
balls b' , b" such that 

— b2 <b' ^bo and 63 ^ b" -< bo, 

— l{b') = l{b”) = ho, e{b') = e{b") = 0, and M{b') = M{b") = M{bi). 

Thus the old ball b\ is deleted from the system but the other balls of B, including 
the balls b bi, remain unchanged except the electric charges of the balls b2, 
bo are changed into new charges e'(bo) = e'{b2) = 0. The frontiers of balls in 
(B — {61}) U {b',b”} are pairwise disjoint. The rules of type (4) are formulated 
in a more general way in Paun (2000a). 

Ad (5). A rule of this type applies to those balls 6 G B for which l{b) = h, 
e{b) = a, and (a) < M{b). According to this rule one copy of the object a is 
deleted from a ball b whose electric charge is changed into a' . One copy of the 
object c is placed in that ball b' for which b ^ b' . If 6 is the greatest ball in B, 
then one copy of c is sent out of the system. 

By a place of application of an evolution rule to a membrane system S we 
mean an ordered pair p = {rp,Xp), where Vp is an expression written in one of 
the forms given in (l)-(5) and Xp is described in the following way: 

— if Tp is in one of the forms in (l)-(3),(5) then Xp is a ball of S for which Xp 
can be applied as it has been already described in Ad (l)~Ad (3), Ad (5), 
respectively, 

Innermost is meant as minimal with respect to C. 



5 
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— if Tp is in the form in (4), then Xp is an ordered triple (6i, 62, 63) of balls of 
S for which Vp can be applied as it has been described in Ad (4). 



A set V of places of application of evolution rules to a membrane system S is 
called unambiguous if for all places p,p' G V the simultaneous applications of 
Up and Up! to Xp and Xp>, respectively do not lead to a case when an object in 
a given ball or a ball itself is a subject of two different changes resulting from 
applications of the rules. The simultaneous applications of the rules of type (1) 
and of type (4) are always unambiguous. 

A P system is defined in Paun (2000a) to be an ordered pair II = {S,TZ), 
where 5 is a membrane system and 7^ is a set of expression like those given 
in (l)-(5) which present the evolution rules. The P systems generate processes 
of evolution. We consider in the paper only finite processes generated by these 
systems. A finite process of evolution generated by system II = {S, TV) is a 
pair of sequences (5 i)(Tq and of membrane systems and non-empty sets 

of places of application, respectively, written 



5n 



■Pi, 



5i 



S2 ■ ■ ■ Sn-l 



S„ 



such that the following conditions hold: 



(A) ~Pi is a maximal (with respect to C) unambiguous set of places of applica- 
tion of evolution rules to such that {rp j p G Vi} C 7Z for 1 < i < n, 

(F2) So is S and for 1 < z < n the system Si is the result of simultaneous 
applications of Vp to Xp in Si-i for all p G Vi, respectively, 

(P3) there is no any rule in V. which can be applied to which means that 
process of evolution stops after n steps. 



A P system II = {S,TZ) is called deterministic if it generates a process given 
by {Si)'f^Q, (T’i)?=i such that the sets Vi of places of application are the greatest 
(with respect to C) unambiguous sets of places of application of evolution rules 
to such that {vp \ p G Vi} C V. 

One can also define a weak deterministic P system to be that system which 
generates only finite processes and these processes stop after the same number 
of steps reaching the same (up to isomorphism) membrane system in the last 
step. 

The P systems used in Paun (2000a) to solve SAT-problem are weak deter- 
ministic systems. 



3 P Systems for Solving 

the Integer Factorization Problem 

We introduce in this section P systems which solve the Integer Factorization 
Problem in time O(ln^n) for a natural number n > 2 to be factored. We apply 
an ^-program Vtct to define these P systems. 

For a natural number n > 2 presented by a binary string a\ . . .ak of length 
k = 1 + [log2 n] (meaningless zeros do not occur in this string) we define a P 
system 77^”^ = such that 
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— the membrane system 5^"^ is such that its underlying set B of balls contains 

exactly three balls 62 £ £ bo, the set L of labels of 5^”^ is the set {0, 1, 2}, 

and l{bi) = i, e(&i) = 0 for all i G {0, 1, 2}, 

— the set O of objects of 5^”^ is the set of ordered triples (x,y,z), written 
{x)y, such that 

• X £ { 0 , 1, >} U {0, 1, . . . , |7^fct| + 1}, one can verify that |7^fct| = 68, 

. y G RGR(T^fct) U {-@, +@, T}, 

• zG{0,l,...,fc+l}U{_L,@}, 

where digits 0 , 1, and auxiliary symbols — +@, _L, > are different from 
natural numbers and names of registers in RGR(T^fct), 

— the labelling function M : B ^ valued in the set of multisets is 
given by 

M{bo) = M(bi) = O, 

k 

M(62) = (E(('^of)) + (w^+i) + 

+((>)f-i)+ E (wf)- 

Z£l- 

where the binary string cti . . . cr^ presents natural number n and I~ = 
RGR(T^fct) — {X,Y} for input registers X and Y of ^-program Viet, 

— the set of rules contains the rules given by the following schemes: 

(^1) - [2 W-®] 2 -[ 2(>)n^ for 1 < * < A: - 1, 

(S'2) [2(i>)*^i(t>)r® ^ Wf-i(0 )f]^> for 1 < i < fc - 1 where 0 is digit, 
{S 3 ) [2(>),^i(o)+® ^ (»)*® i(l)r]2 . for 1 < i < A: - 1, where 1 is digit, 
(^4 ) [i[ 2]2-[2 ]J]?-[i[ 2 ]^]?[i[ 2]^]?, 

(•S's) bWf]” ^ (»)fc (1)@]2> where 1 is number, 

(P) the set of rules determined by ^-program Vtct and described in Ap- 
pendix by the clauses (1)-(16), 

(Fi) bd'PfctI + l)@(i>)/ ^ ]° for ‘^ < j < k, where A is output 

register of Vfet, 

(F 2 ) [2 (l)i]^-(l)i, 

(F 3 ) Kl)i]°-[.]+(l)ifor*G{0,l}. 

There is no any other rule in pfo) than that described by the above schemes. 

Theorem. For every natural number n > 2 the P system 77^^ = 
is a deterministic system and the process of evolution generated by it stops after 
the number of steps no greater than C-ln^ n for some constant C. Moreover, the 
process generated by reaches in the last step that membrane system S whose 
innermost halls distinguished by electric charge + assigned to them contain all 
non-trivial divisors ofn if they exist and also greatest hall ofS has electric charge 
+ in this case, otherwise its charge is 0. 



fc -2 






3=0 
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Proof. We outline a proof of the theorem by describing a process of evolution 
generated by 7 ^^"^). We write Si to denote that membrane system 

which results from transformation of the system 5 *-"^ after i steps of evolution 
realized by simultaneous applications of the rules contained in We write 

Bi and Mi to denote the underlying set of balls of Si and the labelling functions 
of Si valued in the set N'^ of multisets, respectively. 

After 2 -(fc— 1 ) steps the membrane system 5 ^"^ is transformed into membrane 
system 52.(fc-i) whose underlying set B2.(fc_i) of balls contains 2^ + 1 balls &o 2 
bj 2 bj (1 < J < The transformation of 5 ^"^ into 52.(fe_i) is realized by 

applications of the rules given by schemes (S'i)-(S'4) such that in an odd step 
some rule in the form (S'!) is applied and in an even step the rules in the forms 
(S'2)-(S'4) are applied simultaneously; see figure below 




In all steps i > 2 ■ {k — 1 ) the underlying sets B^ of balls of Si remain 
unchanged (i.e., B^ = B2.(fc-i) for i > 2 • (/c — 1)) until the object (1)^ appears 
in some innermost balls. 

There is a one to one correspondence between 2 ^~^ binary strings of length 
k — 1 and 2*“^ innermost balls in B2.fc_i which is determined in consequence of 
applying (S'5) by the inequality: 

(*) (W)r • • • K-i)LiWi) < m2.,_i(6") 

for an innermost ball and a binary string a[ . . . cr^_^ of length k — 1 . We have 
also that 

M2.k-i{bo) = M2.k-i{b'j) = O for every 1 < j < 2'=“2 

In each step i > 2 ■ k — 1 for every innermost ball b'' the objects contained in 
bj evolve according the rules described in (P) such that the process of evolution 
simulates the realization of the ^-program Vfct for the input data given by strings 

(**) sx = cTi...crfc and sy = cr[ . . . 

where sx presents the number n and inequality (*) holds for sy. The contents of 
registers Z in RGR(Pfct) processed during the realization of Vfct are represented 
by multisets 

for binary strings a'{ . . . contained in Z. 

The objects (j)@ for 1 < j < |Pfct| are used to control the simulation of the 
execution of single instructions in the order in which they appear in Viet- 

The multisets < Mi{b”) are aimed to control the 

simulation of the execution of the instruction LOOP jZj appearing as an m-th 
element of Pfct (see clause (6) in the Appendix) for a register Z € RGR(Pfct) 
and 1 < TO < |Pfct| — 1 - 
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Thus in each step i > 2-fc— 1 the process of evolution simultaneously simulates 
2^“^ realizations of /-program Viet for 2^“^ different input data given by strings 
sx and sy described in (**), where sx is a constant string presenting the number 
n and sy is an arbitrary binary string of length k — 1. 

Hence the process of evolution stops after a number of steps no greater than 
3C • In^ n, see the Corollary from Section 1. 

The rules given by schemes {Fi), {F 2 ), (T3) are used to assign the electric 
charge -I- to those balls which contain nontrivial divisors of number n if they 
exist. By using rules given by scheme (F3) there is assigned the electric charge -I- 
to the greatest ball if there exist nontrivial divisors of n, otherwise the electric 
charge of this ball remains 0. 



Concluding Remarks. Reading the Theorem and its proof one finds that the 
Integer Factorization Problem can be solved in a polynomial time by using P 
systems but with an exponential growth of the number of processors, where pro- 
cessors are meant here as balls of membrane systems. We conjecture that by 
combining the methods outlined in Lenstra and Lenstra (1993) and an idea of 
probabilistic P systems one could obtain probabilistic P systems which can solve 
the Integer Factorization Problem in a polynomial time and with a subexponen- 
tial growth of the number of processors. 

One can modify P systems Tjfo) (n > 2) to obtain new P systems which 
besides computing non-trivial divisors of n select also those divisors which are 
prime numbers. A modification can be done by using some new rules related to 
the rules of type (4) described in Section 2. 



4 Appendix 



Let V be an /-program written by using only instructions introduced in Section 1 
and let T> be input data for V such that SPACE(7^, T>) <k for a natural number 
k > \. We define the set 0-pjy of objects determined by V and T> in the same 
way as the set of objects of the membrane system 5^^ in Section 3, except Vtct 
is replaced by V. The set of rules determined by occurrences of instructions in 
V and by V, written TZ-p^v, is defined by the following clauses by using 
We use here capital Latin letters as variables ranging over RGR('P). 



(1) if 1 < j < \V\,X,Y G RGR(T^), and instruction A := P is j-th element of 
V, then the rules given by the following schemes belong to TZp^p : 

(si.l) [2(j)@(>)^+l(l>)^+l ^ (>)n+l(>)^+lWmW„]^ for 0 < TO < /c, 0 < 
n < k, where Um and are strings of objects belonging to Op;p 
such that 



'Om 

Wn 



(j)^ if TO = 0, 

U)o ■ ■ ■ (j)m if TO > 0, 
empty string if n = 0, 
U)i ■ ■ ■ (j)n if n > 0, 
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(si, 2 ) bOOf (o')f ^ (0)i]2 for 1 < z < A: and for a digit a G {0, 1}, 

(si.s) [i{j)J {cr)Y (cr)f (cr)f]^ for 1 < i < fc and for a digit a G {0, 1}, 

(sm) [ 2 (J)^-(J + I)ib 

( 2 ) if 1 < j < | 7 ^|, AT G RGR(T^), and instruction X := A\s j-th element of V, 
then the rules given by the following schemes belong to TZ-p^-d- 

(5 2 .1) [ 2 (j)@(>)m+i ^ Wf ^m]2 for 0 < 771 < fc, where Vm is a string of 
objects on Op^p such that 

„ _ / (j + 1 )@ if m = 0 , 

(s 2,2) the scheme (51,2), 

(52,3) the scheme (51,4), 

( 3 ) if 1 < j < G RGR(T^), and instruction X := 0 is j-th element of V, 

then the rules given by the following schemes belong to TZp^p: 

(53.1) [ 2 (j)@(>)m-K ^ ( 0 )f (»)f for 0 < TO < fc where is a string 
of objects in Op^p as in (s2,i), 

(53.2) the scheme (51,2), 

(53.3) the scheme (51^4), 

( 4 ) if 1 < j < | 7 ^|, X G RGR(T^), and instruction X := A 0 X is j-th element 
of V, then the rules given by the following schemes belong to TZp^p: 

(54.1) [ 2 (j)@(>)m-K ^ for 0 < TO < fc, where zz„ is a string 

of objects in Op^p as in (si^), 

(54.2) bO')^ (>)™+i ^ (j + ]° for 1 < TO < fc, 

(54.3) b(i)^ Wf ^ (j + l)@(0)f (»)^]°> where 0 is a digit, 

(54^4) the scheme (54,2) > 

( 5 ) if 1 < j < I'PIjX G RGR(T^), and instruction X := |X| < 1 is j-th element 
of V, then the rules given by the following schemes belong to TZp^p: 

(s5,i) [2(j)@(>)m ^ tm]2 for 1 < TO < fc, where tm is a string of objects 
belonging to Op^p such that 



f 0 + if TO = 1 (here 0 is digit), 

tm= I 0')if (j)f (>)^ if TO = 2 , 

(55.2) b(i)^ Wm ^ (j + ]2 for 2 < TO < fc -b 1 , 

(55.3) bO')^ ^ (j + l)@(0)f where 0 is a digit, 

(55.4) the scheme (51,2)) 

( 6 ) if 1 < j < |'P |,1 < q < G RGR(T^), and both the occurrence 

of LOOP |X| as j-th element of V and the occurrence of END as g-th 
element of V are matching, then the rules given by the following schemes 
belong to TZp p: 

M b(j)i’(>)f -(9 + i)i(>)f]^, 

(56.2) bO')@(>)m-K ^ ( 9 )@(j)i^ • ■ • ObmObm-eiWm+l]^ for 1 < TO < fc, 

(56.3) b( 9 )@(j)b(j)*>i ^ (j + l)@(i)*®]2 for 1 < z < fc. 
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(56.4) [2(<Z)i(i)f-(9+l)i]^. 

(7) if 1 < j < € RGR(T^), and instruction X := t(-^) is j-th element of 

V, then the rules given by the following schemes belong to TZ-p p: 

( 57 . 1 ) [2(j)i(>) m+1 ^ '^m 2 for 0 < m < k, where Um is a string 

of objects as in (si^i), 

( 57 . 2 ) [ 2 (j)^ (j)f (>)m+i ^ (j + l)@(^)m ]2 for 1 < m < fc and for a 
digit a e {0, 1}, 

( 57 . 3 ) [2(j)^(>)f-(j + l)^(>)f]^ 

( 57 . 4 ) [2U)f+ii^)f+i (c^)f ]° for 1 < t < fc and a digit a G {0, 1}, 

(8) if 1 < j < G RGR(T^), and instruction X := 0 f is j-th element 

of V, then the rules given by the following schemes belong to TZp p: 

M [2(j)i(>)f-(j + i)i(>)f]^, 

(5 8 . 2 ) [2{j)2{^)ii^)m+i Wm(t>)m+i ]2 for 1 < m < fc and a digit a G 

{0,1}, where is a string of objects in Op^p such that 

=/(•? + l)@(l)f if is digit 1, 

if a is digit 0, 

(58.3) [ 2 (j)^ (J)f (0)f (>)^+i - (j + 1)^(>)^]^ for 1 < m < fc, 

(5 8 . 4 ) the scheme ( 574 ), 

(9) if 1 < j < I'Pl,-^ G RGR(T^), and instruction X := }(X) is j-th element of 
V, then the rules given by the following schemes belong to TZp p: 

( 59 . 1 ) [2(j)^(>)f-(j + l)i(>)f]^ 

( 59 . 2 ) [ 2 (j)@(cr);^(>);^+i ^ (j -h 1)@(>)„]2 for 1 < TO < fc and a digit 
aG {0,1}, 

(10) if 1 < j < |7^|, -Y, 1" G RGR(T^), and instruction X := XEzY is j-th element 
of V, then the rules given by the following schemes belong to TZp p: 

( 510 . 1 ) [2(j)^(>)r-(j+i)i(>)r]^, 

(sio. 2 ) [ 2 (j)@(CT)f(>);^+i ^ (j -h {o)^+i{t>)^+2]2 for 0 < TO < A: 

and for a digit a G {0, 1}, 

(11) if 1 < j < I'Pl,-^ G RGR('P), and instruction X := X 0 is j-th element 
of T’, then the rules given by the following scheme belong to TZp^p: 

(sii.i) [ 2 (j)@(>)™+! ^ (j + l)@(0)^+i(>);^+2]2, for 0 < TO < A:, 

(12) if 1 < j < |7^|, X, y G RGR(T^) and instruction X := X \ Y is j-th element 
of V, then the rules given by the following schemes belong to TZp^p: 

(si 2 .i) [ 2 (j)@(>)^+i(i>)^+i ^ s^{>)Xi+ii>)X+i ]2 for 0 < TO < A:,0 < n < 

k, where is a string of objects in Op^p such that 

^ f {j)X if TO = 0 orn = 0, 

"" 1 (j)o ■ ■ ■ (i)m if TO > 0 and n > 0, 

( 512 . 2 ) - ij + ^)U^)Xi>)^+ 2 {^)Xi>)X+l ]°2 for 
0 < m < k,0 < n < k, and for a digit cr G {0, 1}, 

( 512 . 3 ) [2(j)^ (>)^+i(>)r - (j + l)i(>)^+i(>)r]§ for 0 < TO < A:, 

(5 12 . 4 ) b(j)f (cr)f ^ {<^)f+i ]2 for 1 < 7 < A: and a digit a G {0, 1}, 
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( 13 ) if 1 < j < \V\,X,Y,Z G RGR(-P), and instruction Z := (S{X,Y,Z) is 
j-th element of V, then the rules given by the following schemes belong to 
TZ-pj)'. 

(si 3 ,l) [ 2 (j)@(>)m ^ (j + l)@(»)i]2 for 2 < m < /c + 1 , 

(513.2) [2(i)@(l)f (t>)fu ^ U + l)@(l)f (i>)fM]^ for a string u of objects 
in Op p in one of the following forms: 

(A) 

(A) 

ih) (o)f(>)f+i(i)^(>)^+i, 

(A) (i)f(>)f+A>)r, 
for 1 < i < A: and 1 < q <k 

(513.3) [2(j)@(>)fw ^ (j + l)@(l)f (>)fw]2 for u of the form (A) 

(513.4) [2(j)@(l)f (»)fw ^ (j + l)@(»)fw]2 for u in one of the following 
forms: 

ih) (o)f(>)f+i(o)^(>)^+i, 

(/e) ( 0 )f(>)f+i(>)r, 

for 1 < t < A: and 1 < g < A:, 

(513.5) [2(j)@(»)f'w ^ (j + l)@(>)f for u in one of the forms (A)-(/ 6 ), 

( 14 ) if 1 < j < \V\,X,Y,Z G RGR(-P), and instruction Z := Q{X,Y,Z) is 
j-th element of V, then the rules given by the following schemes belong to 
TZp^p: 

(si4,i) the same as (S134), 

(514.2) [2(j)@(>)fu ^ (j + l)@(l)f (>)fw]° for u of the form (A), 

(514.3) [2(j)@(l)f (t>)fu ^ (j + l)@(l)f (>)fM]2 for u in one of the forms 

(/i)) (/s)) (A)? (A)) 

(514.4) [2(j)@(>)fu ^ (j + l)@(>)f u]° for u in one of the forms (A), (A). 

(A)) (A)) 

(514.5) [2(j)@(l)f (t>)fu ^ (j -b l)@(i>)fu]2 for U in one of the forms (A), 

(A), 

( 15 ) if 1 < j < \r\,W,X,Y,Z G RGR(-P), and instruction W := H(X,Y,Z) is 
j-th element of P, then the rules given by the following schemes belong to 
TZp,p>'. 

(siS.l) [ 2 (j)@(>)m+l ^ Pm]2 for 0 < m < A: and for a string prn of objects 
in Op_p such that 

r, =[ if m = 0 , 

I 0 ')^ • • ■ (j)m+l if TO > 0 , 

(515.2) b(j)r (A1 +i] 2 for 1 < a < A: and cr G { 0 , l,o}, 

(515.3) [2(j)o^u ^ (a)^u{j + 1)^]^ for a string u of objects in Op^p in 
one of the following forms 

(^i)f(>)l^i(<r 2 )^(>)^+i(a 3 )f(>)f+i, 

(^i)f(>)m(>)r(>)f, 
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(for l<(j<fc, and for digits cti, ct 2 , CT 3 G 

{ 0 , 1 }) and for a digit a depending on u in such a way that ct is 1 
if digit 1 appears three times in u or 1 appears one time in u (in 
the places where CTi, (J 2 , Us occur), otherwise a is digit 0, 

(S 15 . 4 ) [ 2 ( 1 ) 0 ^ (>)f - (0)f(>)f (j + 1)^]^, 

(16) there is no any other rule in TZ-p p than those given by the above clauses 
(1)-(15). 
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1 Multisets in Unconventional Models of Computation 

All operational models of computation define two kinds of elementary concepts: 
Data and discrete change of data. Data are usually aggregated in data locations. 
Data are changed by actions. An action usually affects some, but not all data 
locations. 

In conventional models, data locations are composed of variables. Actions 
update variables, i.e. replace a variable’s present value by a new value. 

In unconventional models, data locations carry frequently sets of data. Oc- 
curence of an action coincidently updates some of those locations (i.e. adds or 
removes elementary data). Actions may concurrently access data locations: Two 
actions may add and/or remove data to or from a location, without interfering 
with each other. In particular, they may add two instances of the same data 
item, or an action may add an instance of a data item of which another item 
is already present in that location. Hence, ordinary sets are not adequate to 
describe concurrent access to data locations; instead, multisets (bags) are the 
adequate choice. In fact, many unconventional system models are based on mul- 
tisets, including Petri Nets [4] and the Chemical Abstract Machine [1]. Milner’s 
unifying approach to unconventional models [2] likewise makes multisets an es- 
sential concept. 

An algorithm with a proper initial state may evolve without multiple data 
items. But any attempt to generally prevent actions to generate multiple in- 
stances of data items in data locations requires actions be given comprehensive 
control over the location. Vice versa, interference free, concurrent access of ac- 
tions to a data location demands that data locations potentially may carry mul- 
tisets of data items. In fact, a number of unconventional models of computation 
suggest actions on multisets. 

In this paper we consider the most elementary model of (distributed) com- 
puting with multisets; i.e. high level Petri Nets. This model allows for useful 
analysis techniques, based on the linear theory of multisets. 
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2 The Linear Theory of Multisets 

The Universe U 

We employ the usual notation for sets, subsets, cartesian product, union, inter- 
section, element, etc. Z denotes the integers; N the non-negative integers. The 
truth values are true or false. 

For the sake of simplicity we assume a universe U, i.e. a collection of all 
relevant items. We construct cartesian products [/" and functions / : [/" ^ U, 
but avoid constructing set theoretical anomalies. 

Multisets 

Formally, a multiset is a mapping 

M :U 

For technical reasons we occasionally employ multisets with negative entries, 

M :U ^Z. 

M is finite if M (u) yf 0 for finitely many u G U only. A finite multiset M without 
negative entries may be written 

Af — [ui, . . . , ttn], 

for each u G U, there are M{u) entries Oi in [ai,...,a„] identical to u. The 
empty multiset, M{u) = 0 for all u G U, is written 



Each element u G U forms the multiset [u]. We often confuse u and [m], context 
allowing. By 

M 

we denote the collection of all multisets. 

Operations on Multisets 

Let M and N be multisets. M is smaller or equal to N 

M <N, 

if M{u) < N{u) for all u G U. The sum, M -|- is a multiset, defined by 

{M + N){u) := M{u)+N{u) 
for all u G U. As an analogy define M — N hy 

(M - N){u) := M{u) - N{u). 

Hence, M — N is a proper multiset (i.e. has no negative entries) iff C M. 
Finally, let 

-M := 0 - M, 

i.e. i-M){u) = ~{M{u)). 
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A^- Valued Functions 

We frequently consider functions / formed 

/ : C/” ^ M 

called A4 - valued. As a particular function let 0 denote the empty function, i.e. 
□ (■ui, . . . ,Un) = []• Each function f : U'^ ^ U can canonically be extended to 
f : ^ M., by i{ui, . . . ,Un) ■= [f{ui, . . . ,Un)]- We often write just / instead 

of f. 

Ad-valued functions can be summed up. For f,g:U^M,let f+g : U ^ M 
be defined as 



if + g){ui, - ■ ■ ,Un) ■■= f{ui,...,Un) + g{ui , . . . , Un) ■ 

Symbolic Representation of Elements and Functions 

As usual, we frequently employ terms to denote data and functions. A term 
is composed from constant symbols, variables and operation symbols. A term 
without variables is called a ground term and can be evaluated, denoting an 
element of the universe. A term with one variable (multiple occurrences allowing) 
denotes a function, /. To compute f{u), replace each occurrence of the variable 
by u. This results in a ground term, denoting an element. A term with n variables, 
together with an order on the variables, denotes an n-ary function /. To compute 
f{u\, . . . ,Un), replace each occurence of the i-th variable by Ui (t = 1, . . . , n). 

A multiset of terms with n variables denotes a multiset valued function. For 
example, the multiset [x ■ x,x + x] denotes a unary function. Applied to the 
integer 2, it returns the multiset [4,4]. 



3 Multiset Based Dynamic Systems 



States 



We start out with a set P = {p\, . . . ,Pm\ of data stores, called places. A state S 
assigns to each place p its actual contents, a multiset Sp. Hence, a state S can 
be written as a vector 

S = 

of multisets. 



Actions 

An action transforms a state S into a new state. S' . An action 



a = {a , a^) 
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has two components, a~ and a+. They describe the elements to be removed 
from places and to be augmented to places, respectively. Technically, actions are 
formed like states, i.e. as P-indexed vectors 

a~ = (a“p)pep and a+ = (a+p)pep 



of multisets. 

There is a quite intuitive graphical representation of actions a, with a circle 
for each place and a box for a itself. An arc from the circle for p to the box and 
is inscribed by a~p. Likewise, an arc from the box to the circle for p is inscribed 
by a+p. Arcs inscribed by the empty multiset are skipped, as well as circles that 
eventually remain isolated. 



[(V,R), (V,W)] 




^ [(R,V), (W,V)] 

Fig. 1. 



As part of the forthcoming case study. Fig. 1 shows an example. We assume 
places A, C and D, the constant V and two pairs of constants, (V, R) and (V, W), 
which also are elements of the universe. Then, let a ~ a = A = [V], a~c = 
[(F, R), (y, W)], a+D = [(i?, y), (TA, y)], and a+c = a ~ d = [] • 

Steps 

A state S enables an action a = (o“, 0 +) iff for each place p G P 

^ p — ‘^p^ 

i.e. iff for each place p G P, the multiset a~p to be removed from p, is required 
to be available at p. In this case, S and a define a step 

S ^S', 

with S' defined for each place p G P hy S'p = Sp — a~p + a+p. The graphical 
representation of an action a, as e.g. in Fig. 1, mimics its role in steps S — ^ S', 
i.e. its effect on S: Items flow along the arcs, as indicated in the arc inscriptions. 
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For example, assume three places A, C, D and a state S with Sa = \V,W], Sq = 
[{V,R),{y,W),{W,V)] and Sd = []■ Then action a, as depicted in Fig. 1, is 
enabled and yields the step S S' , with S' a = Sa , S' c = [(hF, F)] and 

s'D = mv),{w,v)]. 

Transitions 

A system usually does with quite a large number of actions. Actions are usually 
clustered into transitions. 

A transition is a parameterized action, with a set Q of occurence modes as pa- 
rameters: 

t={t~,t+), 

with t~ = (t~p)p^p and r*" = (t'^p)p^p. Each t~p and t+p is a function 
t~p '■ Q ^ M and t~^p : Q ^ M. 

A state S enables t in a mode g G Q iff for each place p G P, t~ p{q) < S. This 
leads to the step 

SA% S' , 

with S' defined for each p G P hy 

S'p = Sp- t~p{q) + F'‘p(g) . 

A transition t can be conceived as a family 

t — (^q)qeQ 

of actions tq = {tq~ ,tq~^), defined for each p G P hy 

tqp = t~pid) and tq^ = r p{q) . 

Fig. 2 depicts a transition. It employs the term representation as introduced at 
the end of Chapter 2, skiping multiset brackets. The action defined by occurence 
mode q = V has been given in Fig 1. 



Behaviors 

As usual, a finite or infinite sequence 

Q *1.91 o *2.92 Q 

Do * Ol * 02 * • • • 

of steps Si-i Si constitutes a behavior. Given a distinguished initial state 
init, a behavior is assumed to start with So = init 
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r(R) 

r(T) 

r(U) 

r(V) 

r(W) 



r(x) 



[(R,T), (R,V), (R,W)], 
[(T,R), (T,U), (T,W)], 
[(U,T)], 

[(V,R), (V,W)] 

[(W,R), (W,T), (W,V)] 

{(y.x) I (x,y) e r(x)} 



Fig. 2. 



Dynamic Systems 



We are now prepared to define the mathematical structure of multiset based 
dynamic systems. A system 



N = {P, T, init) 

consists of a set P of states, a set T of transitions as defined above, and an initial 
state, init. N characterizes a (finite or infinite) set of behaviors. 

Graphical Representation 

There is a quite intuitive graphical representation of dynamic systems: Each 
place p is represented us a circle (or ellipse), inscribed by init{p). Each transition 
t is represented as a box, with arrows to and from circles. The arrow from p to t 
is inscribed by t~p ; the arrow from t to p is inscribed by t~^p. Arrows inscribed 
by the empty multiset function 0 are skipped. 

4 A Case Study 

Consensus in a Network of Agents 

Let U he a, set of sites. Each site u is capable of sending messages to a set of 
other sites (the “neighbors” of u) . All sites together intend to reach some kind of 
contract or agreement. There is no mediator or broker; each site runs the same 
local algorithm. The graph in Fig. 3 depicts the set {R,T,U,V,W} of sites. 
Neighborship is indicated by arcs. For example, R and W are neighbors, R and 
U are not neighbored. Each site is always in one of two states, pending or agreed. 
Initially, all sites are pending. In order to negotiate with its neighbors, a pending 
site may coincidently send requests to all its neighbors, or may receive a request 
from some neighbor. For example, pending site V coincidently sends a message 
to both R and W; or V receives a message from W. A site is to circulate at most 
one bunch of requests at a time. Each receiver of a request is obliged to answer it. 
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u 



Fig. 3. A network of sites 



In a situation where all requests sent by a site u have been answered, u may 
turn from pending to agreed, u remains agreed unless u obtains a message. Upon 
such a message, u turns pending again. 

Call a state stable if all sites are agreed. We suggest an algorithm that does 
not guarantee a stable state will ever be reached. But it guarantees a stable state 
will retain: In a stable state, no site will turn pending again. 

Fig. 4 shows the behavior of site V: The site is initially pending and its two 
requests (V, R) and (U, W) to its neighbors R and W are completed. Intuitively, 
(V, R) and (V, W) are “envelopes” that V may send to R and to W, respectively. 
Pending V has the choice of three actions, depending on its environment: With 
both requests completed, V may move to agreed by help of action d. Alterna- 
tively, V may initiate requests R and W, by action a, yielding {R, V) and (LF, V) 
at initiated requests. And R may receive a message (U, y) from a neighbor y (with 
y = R or y = W) and return the envelope (y, V) to this neighbor. 

What has been done in Fig. 4 for the site V, can likewise be done for all oth- 
ersites, too. The would be structurally identical; only inscriptions would change. 
Using a variable, x, for sites. Fig. 5 then squeezes all Figures in one: For a 
pending site x, transition a initiates the requests (y,x) to all neighbors y of x, 
provided all requests from x are completed. 



d 



pending 




agreed 



Fig. 4. 
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d 



pending 

sites 




agreed 

sites 



sort 


site 


tot r, r :site -> set of messages 


sort 


message = site x site 


var X, y : site 


const 


U : set of sites 


r(x) = {x} X M(x) 


const 


M : set of messages 


fix) = M{x) x{x} 



Fig. 5. Consensus Algorithm 



A pending site x receives a request (x, y) along transition a, answers it by a 
message {y, x) and remains pending. An agreed site x receives a request (x, y) 
along transition c, answers it by a message (t/,x), and turns pending. Finally, 
a pending site x may turn agreed along transition d, provided all requests r(x) 
of X are completed. Notice the different roles of places in Fig. 5: To conceive 
the behavior of a single site u, replace x by u. Hence, conceive five copies of 
Fig 5, each replacing x by one of the sites R,T,U,V,W. Each site u has its 
local states pending('u) and agreed(u), and its local data store for completed 
requests. The initiated requests place links the five copies: It contains messages 
{u, v) sent by w, but not get received by u. The initial state enables transition 
a in each mode x = i?, . . . , x = IF, because pending site contains i?, . . . , IF and 
completed requests contains r{R) U . . . U r(IF). Likewise, transition d is enabled 
in all modes. Occurrence of d in all modes x = R,...,x = W leads to the 

stable state. Alternatively, init S' is a step with S (completed requests) = 
r(S) U . . . U r(IF) and S (initiated requests) = F{A). 

The Consensus Algorithm 

With U = {R,T,U,V,W} and M as described above. Fig. 5 describes the con- 
sensus algorithm for the network of Fig. 3. For a different set U of agents and 
a different set M of neighbors. Fig. 5 likewise represents the consensus algo- 
rithm. Generally formulated. Fig. 5 represents the algorithm for any network: 
The specification of U, M, r and f allows for infinitely many interpretations. 
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5 Verification 



The linear structure of steps allows for powerful analysis techniques; they even 
apply to the schematic level of system models, as discussed above. 



Linear Functions 

A function / : At — > At is linear iff for all multisets M, N holds 

f{M + N) = f{M) + f{N), and 
/(-M) = -/(M) . 

To each unary Af-valued function f : U ^ M. there exists a unique linear 
extension i : M ^ M with f([t(]) = f{u). 



Composition of Functions 

A function f : M ^ M can be composed with a Ad-valued function g : ^ M, 

yielding fog: ^ M, defined as usual for mi, . . . , G U 

/ o g{ui, ...,Un)= f{g{ui, Un)). 

For the special case of n = 0, g{) is a multiset, written g. Then 

f*9 = fig) 

which is application of / to a multiset. 



Composition as Vector Product 

For a vector i = {ip)p^p of functions ip : A4 ^ A4 and a vector t = (tp)p^p 
of functions tp : C/" — > Ad for some fixed n, we employ composition of their 
components as a product and define the usual vector product of i and t as 

i ■ t := iot. 

pGP 



Obviously, z • t is a function z • t ^ Ad. In case of 0-ary components tp, i.e. 
multisets, we write 

i{t) instead of iot 



Obviously, i{t) G Ad. 
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Place Invariants 

A vector {ip)p^p of linear multiset functions ip : A4 ^ A4 is a place invariant 
of a dynamic system ^ iff for each transition t of 

iut = 0. 

A place invariant establishes a system invariant as follows: 

Theorem 1. Let ^ = (P, T, init) be a dynamic system and let i he a place 
invariant of^. For each reachable state S of holds: 

i{S) = z(init) . 



Traps 

A vector i = {ip)p^p of linear multiset function ip : A4 —> A4 is a trap of Y 
for all transitions t of 

iu t > 0 . 



Theorem 2. Let N = {P, T, init) he a dynamic system and let i he a trap of N 
with i • init > 0 For each reachable state S of N holds: 

i{s) > D . 

Verification of the Consensus Algorithm 

We claimed that stable states of the consensus algorithm retain. Hence we have 
to show that at each reachable state of the algorithm holds: if all sites are agreed, 
then no request is initiated. Shortly, for all reachable S, 

S ^ {agreed sites = U initiated requests = []). (5-1) 

Given a concrete set U and a concrete function r (and hence f and M), 
proposition (5.1) can be examined by visiting all reachable states S. 

But we are not given concrete U and r. Instead, Fig. 5 is a schema for any 
choice of U and r. And (5.1) is claimed to hold for any such choice. As a matter 
of convenience. Fig. 6 replaces the long place inscriptions of Fig. 5 by A, B, C, 
D. What we have to show then reduces to 

B = U^D=[]. (5.2) 

This can now easily be done by help of two place invariants, 

A + B = U (5.3) 

and 

C + D = M, (5.4) 

where D = {(y,a;)| {x,y) G P}, as well as the trap 

r{A)+C>r{U). 



(5.5) 
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d 




sort 


site 


fct r, r :site -> set of messages 


sort 


message = site x site 


var X, y : site 


const 


U : set of sites 


r(x) = {X} X M(x) 


const 


M : set of messages 


f(x) = M{x) X {x} 



Fig. 6. 



Obviously, r{U) = M. (5.6) 

Then (5.3) and (5.6) imply 

r{A) + r{B) = M. (5.7) 

Subtracting (5.5) from the sum of (5.4) and (5.7) 

yields (5.4) + (5.7) - (5.5) : 
r{B) + D < M. 

Now we conclude from (5.6) 

B = U ^ r{B) = r{U) = M 
and r{B) = M ^ D = [] , by (5.8). 
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Abstract. Artificial chemical system are well studied in Artificial Life 
and Complexity. Here we overview results about such a class of artificial 
chemical systems: Abstract Rewriting Systems on Multisets (ARMS), 
which are particular types of P Systems and exhibit complex behaviors 
such as non-linear oscillations. In short, an ARMS consists of a membrane 
which contains “chemical compounds” (denoted by symbols); these com- 
pounds evolve through chemical reactions in a cell-like device. To such 
systems we apply a genetic method and, by a simulation on computer, 
we find several results of Artificial Life interest. 



1 Introduction 

Artificial Life is the study of man-made systems that exhibit behaviors 
characteristic of natural living systems. It complements the traditional 
biological sciences concerned with the analysis of living organisms by 
attempting to synthesize life-like behaviors within computers and other 
artificial media [36]. 

C.Langton, Artificial Life. 

It has been over a decade since Langton proposed “Artificial Life (ALife)” 
with the above definition. He also explained “by extending the empirical founda- 
tion upon which biology is based beyond the carbon-chain life that has evolved 
on Earth, Artificial Life can contribute to theoretical biology by locating life-as- 
we-know-it within the larger picture of life-as-it-could-be” . 

With the advent in biotechnology and increasing interests in genome infor- 
mation worldwide, genome databases progress so fast that the whole sequence 
database of human genome will be published soon. Hence now we are confronted 
with the problem of how to deal with the overwhelming genome information. As 
traditional biological analysis is only powerful in analyzing a single gene most of 
the case, ALife study turns out to be more and more important in carbon-chain 
life science. 



C.S. Calude et al. (Eds.): Multiset Processing, LNCS 2235, pp. 299—346, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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One challenging and important theme in genome science is to discover gene- 
networks. Our knowledge of the genes’ regulation has come from an accumulation 
of fundamental studies over the past. However recently with a new technology, 
the DNA chip provides us with genome wide information fast and informative. 
Combination with genomic database and use of ALife, now we are closer to find 
the correlation and regulation between genes. In order to understand the gene 
networks, we usually need to model a system by first synthesizing the empirical 
data, and in this work an Alife approach is necessary. One of the ALife models, 
Kauffuman’s boolean network model, has been rather instructive in the analysis 
of gene networks [52] . 

Therefore, the subjects of ALife studies are changing. It is much easier these 
days to obtain numerous data from living systems, though we still need methods 
for observing or analyzing them systematically. Thus, it is necessary to construct 
theoretical/symbolic systems. This resemble the studies in cosmology. Although 
we cannot observe Big-bang, we can infer many features from empirical data 
through theoretical models. 

Empirically, abstract chemical systems turn to be powerful in describing com- 
plex systems. In this paper first introduce the abstract chemical model we use, 
based on multiset rewriting, and then we present some experimental results. 
We will also briefly discuss an application in ecology. The used systems can be 
considered as P systems with only one membrane. After that, we enhance the 
adequacy of the system to ALife matters by introducing a membrane structure, 
similar to those used in P systems. 

Abstract Chemical System. Life can be considered as a system in a specific class 
of chemical reaction systems, but real biochemical systems are so complex that 
it is difficult to reconstruct the precise dynamics of such a system. Thus, it is 
important to abstract the essential properties of biochemical systems in order to 
obtain insights into their dynamical properties. 

There have been developed various Artificial Chemistries. In a broad sence, 
an artificial chemistry is a man-made system which is similar to a chemical 
system. It can be defined as a set of objects and a set of reaction rules which 
specify how the objects interact. It covers large held, such as: 

Modeling: living systems (e.g., origin of life), sociology and parallel pro- 
cesses. 

Information processing: control, automatic proving, chemical computing. 

Optimization: Combinatorial problems (e.g., TSP). 

2 ARMS 

Extending the concepts of abstract rewriting systems, we introduce the notion 
of an abstract rewriting system on multisets (ARMS). Intuitively, an ARMS is 
like a chemical solution in which floating molecules can interact with each other 
according to given reaction rules. Technically, a chemical solution is a finite 
multiset of elements denoted by symbols from a given alphabet, A = {a,b, . . . 




Abstract Rewriting Systems on Multisets 301 



these elements correspond to molecules. Reaction rules that act on the molecules 
are specified in ARMS by rewriting rules. As to the intuitive meaning of ARMS, 
we refer to the study of chemical abstract machines [5] . 

Let A be an alphabet (a finite set of abstract symbols). The set of all strings 
over A is denoted by A*; the empty string is denoted by A. (Thus, A* is the free 
monoid generated by A under the operation of concatenation, with identity A.) 
The length of a string w G A* is denoted by |w|. 

A rewriting rule over A is a pair of strings {u, v), u,v G A* . We write such a 
rule in the form u ^ v. Note that u and v can also be empty. A rewriting system 
is a pair {A, R), where A is an alphabet and i? is a finite set of rewriting rules 
over A. 

With respect to a rewriting system 7 = (A, R) we define over A* a relation 
as follows: x y iS x = X\UX2 and y = X1VX2, for some xi,X2 G A* and 
u V G R. The reflexive and transitive closure of this relation is denoted by 

^From now on, we work with an alphabet A whose elements are called objects; 
the alphabet itself is called a set of objects. 

A multiset over a set of objects A is a mapping M : A — > N, where N is 
the set of natural numbers, 0, 1, 2 ,.... The number M{a), for a G A, is the 
multiplicity of object a in the multiset M. Note that we do not accept here an 
infinite multiplicity. The set {a G A | M{a) > 0} is denoted by supp{M) and is 
called the support of M. The number X^aeA -^(®) denoted by weight{M) and 
is called the weight of M. 

We denote by A"^ the set of all multisets over A, including the empty multiset, 
0, defined by 0 (a) = 0 for all a G A. 

A multiset M : A — > N, for A = {oi, . . . , a„}, can be naturally represented 
by the string and by any other permutation of this string. 

Conversely, with any string w over A we can associate a multiset: denote by 
the number of occurrences of object in ru, 1 < f < n; then, the multiset 
associated with w, denoted by M„,, is defined by Myj(ai) = |ru|oi, 1 <i <n. 

The union of two multisets Mi, M2 : A — > N is the multiset (Mi U M2) : 
A — > N defined by (Mi U M2)(a) = Mi (a) + M2(a), for all a G A. If Mi(a) < 
M2 (a) for all a G A, then we say that multiset Mi is included in multiset M2 and 
we write Mi C M2. In such a case, we define the multiset difference Mi — M2 
by (M2 — Ml) (a) = M2 (a) — Mi (a), for all a G A. (Note that when Mi is not 
included in M2, the difference is not defined). 

A rewriting rule 

a ^ a . . .b, 

is called a heating rule and denoted as rA>o'j it is intended to contribute to the 
stirring solution. It breaks up a complex molecule into smaller ones: ions. On 
the other hand, a rule such as 

a . . . c ^ b, 

is called a cooling rule and denoted as rA<0i it rebuilds molecules from smaller 
ones. In this paper, reversible reactions, i.e., S ^ T, are not considered. We 
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shall not formally introduce the refinement of ions and molecules though we use 
refinement informally to help intuition (for both types of rules we refer to [5]). 

A multiset rewriting rule over a set A of objects is a pair (Mi, M 2 ), of elements 
in (which can be represented as a rewriting rule w\ ^ W 2 , for two strings 
wi,W 2 G A* such that = Mi and = M 2 ). We use to represent such a 
rule in the form Mi ^ M 2 . 

An abstract rewriting system on multisets (in short, an ARMS) is a pair 

r = {A,R), 

where: 

(1) A is a set of objects; 

(2) i? is a finite set of multiset evolution rules over A. 

With respect to an ARMS F, we can define over A"^ a relation: (=k): for 
M, M' G A^ we write M =k M' iff 

M' = (M - (Ml U . . . U Mfc)) U (M( U . . . U M^, ) 

for some Mi M[ G i?, 1 < i < fc, A: > 1, and there is no rule Mg Mg G R 
such that Mg C (M— (MiU. . .UM^,)); at most one of the multisets Mj, 1 < i < k, 
may be empty. 

With respect to an ARMS F = (A, R) we can define various types of multi- 
sets: 

- A multiset M G A# is halted if there is no M' G A# such that M =k M' 
(this is equivalent to the fact that there is no rule Mi ^ M 2 G R such that 
Ml C M). 

- A multiset M G A# is in initial state if there is no M' G A"^ such that 
M' M. 

With respect to an ARMS F, we can define over three transition relations: 

(1) sequential transition (=kg): for M, M' G A# we write M =kg M' iff 

M' = (M-Mi) UM(, 

for some Mi ^ M( G R. 

(2) free transition (=k/): for M, M' G A^b we write M =k/ M' iff 

M' = {M - (Ml U . . . U Mfc)) U (M( U . . . U M(,, ) 

for some Mi M[ G i?, 1 < z < fc, fc > 1; at most one of the multisets 

Mi, 1 < z < Ac, may be empty. 

(3) maximally parallel transition (=kp): for M, M' G A^ we write M =kp M' 
iff 

M' = {M - (Ml U . . . U Mfc)) U (M{ U . . . U M^, ) 

for some Mi M' G i?, 1 < z < fc, fc > 1, and there is no rule Mg Mg G R 

such that Mg C (M — (Mi U . . . U Mfc)); at most one of the multisets Mi, 1 < 
i < k, may be empty. 

(Of course, in all cases, the union and the difference operations are multiset 
operations.) 
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2.1 How ARMS Works 

Example. Let us consider the ARMS defined as follows: 

r = {A,R), 

A = {a,b, c,d,e,f}, 

R = {a, a,a ^ c : ri,b ^ d : r2,c ^ e : rs, 

d^ f,f :r4,a^ a, b,b, a: T5,f ^ h: tq}. 



We assume the maximal size of a multiset is 4 and the initial state is given 
by {a, a, 6 , a}. In an ARMS, the rewriting rules are applied in parallel. When 
there are more than two applicable rules, then one rule is selected randomly. 
Figure 1 illustrates a a sequence rewriting steps, starting from the mentioned 
initial state. 



{a, h, a, a} C a, a, a, a, b 

I (the left hand side of ri, C 2 , rs) 

{c, d} ^ c, d (the left hand side of rs,r 4 ) 

f 

{fi) /i /} ^ / (the left hand side of rg) 

I 

{e,h,h} There are no rule to apply, it reaches 
the halt state 

Fig. 1. Example of rewriting steps of ARMS 



At the first step, the left hand side of rule of ri, T2 and are included in 
the initial state. In the next step, and r4 are applied in parallel and {c, d} 
is rewritten into {e, /, /}. In step 3 , by using tq, {e, /, /} is transformed into 
{e, h, h}. There are no rules that can transform the multiset any further so, the 
multiset is in a halt state. 

3 Typical Behaviors of ARMS 

In this section we discuss the following two types of behaviors of ARMS: (i) 
ARMS in terms of sequential dynamical system and (ii) the simulation of Brus- 
selator model which is a model of Belousov-Zyabotinsky reaction. 

3.1 ARMS in Terms of Sequential Dynamical System 

Sequential Dynamical Systems (SDs) are a new class of dynamical systems mo- 
tivated by the formalization of simulation by composed local maps. Intuitively, 
SDs are dynamical systems produced by sequentially ordered compositions of 
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Table 1. Rule Ordering and Emergence of Cycles 



Emergence of Cycles 


Terminated 


9.4%(68/720) 


90.6%(652/720) 



Table 2. Steps needed for Emergence of Cycles 



Rewriting Steps 15 16 17 18 19 20 21 22 
Number of Cycles 5 8 8 16 5 9 10 2 



local maps. The dynamical properties of SDs delimit the behavioral repertoire 
of simulation. The concept of SDs can be found in some studies in Artificial 
Chemistries, [32] [17]; Barrett at el. have mathematically formalized this notion 
[2], [3]. They employ sequential cellular automata (sCA) over random graphs 
as a paradigmatic framework for elements of a theory of simulation and show 
that dynamical systems generated by simulations can be classified by equiva- 
lence classes on update sequences. On the other hand, we employ ARMS with 
sequential transition and show that update sequence affects the state transition. 



ARMS with Sequential Transitions 

Rule Order and Emergence of Cycles. We examine first the relationship between 
the ordering of rules and the emergence of cycles. 

In the case of the ARMS from the previous example, the total number of 
possible orderings of rules is equal to 6! = 720. For each order, we assume that 
a will be introduced in the system after each rewriting step and we will check 
whether cycles will emerge or not. 

Table 1 shows the results of this experiment. Surprisingly, cycles emerge in 
only 9.4 per cents (68/720) of 720 ordering, and in the remaining 90.6 per cents 
the rewriting calculus is halting. In 68 orders where cycles emerge, we need from 
15 to 22 steps for generating a cycle and the average and the standard deviation 
is 19 and 3.3 steps (table 2), respectively. The average of the period length of 
cycles is 5.5 steps, whose standard deviation is 0.49 steps. 

As to halting cases, the average of steps needed for termination is 39 steps, 
and the standard deviation is 6.48 steps. These results are summarized in Table 3. 

Typical Examples. In this paragraph, we present two examples^ which illustrates 
the importance of rule order. The first example is a typical case which generates 
a cycle, and it uses the following rule order: 

Ori = [ri ^ r2 ^ rz ^ ri ^ r^ ^ re}, 

^ It is notable that these two orderings provide illustrative examples on the formal 
property of non-termination of ARMS, which is discussed in Section 5. 
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Table 3. Summary of Statistical Characteristics 

Steps Needed for Emerging Cycles 18.0 ± 3.3 Steps 
Length of Cycles 5.5 ± 0.49 Steps 

Steps Needed for Terminating 39.0 ± 6.48 Steps 

whose state transition is shown below. After 18 steps, the system is stable in 
forming a cycle, with the period length of 5 steps. 

!•{} 

2. {a} 

J, 16 steps 

18. {o, a, a, a, a, b, c, e} 

19. {a, a, a, a, a, c, c, e} 

20. {o, a, a, a, a, a, c, d, d, e} 

21. {a, a, a, a, a, a, a, c, d} 

22. {a, a, a, a, a, o, a, o, c, e} 

23. {a, a, a, a, a, b, c, e} 

24. {a, a, a, a, a, c, c, e} 

25. {a, a, a, a, a, a, c, d, d, e} 

26. {a, a, a, a, a, a, a, c, d} 

27. {o, a, a, a, a, a, a, a, c, e} 

The next example is a typical case, which terminates in 42 steps. Although 
the ARMS applies the same rules, ri to for the state transition, the obtained 
result is completely different, as shown below. This example has the following 
rule order: 

Or2 = {rs ^ ri ^ T2 ^ T4 ^ T5 ^ Tq} 

Then, the state transition is as follows: 

!•{} 

2. {a} 

I 16 steps 

18. {a, a, a, a, a, a, a, a, c, c} 

19. {a, a, a, a, a, b, c, c} 

20. {a, a, a, a, a, c, c, c} 

21. {a, a, a, a, a, a, c, c, c} 

22. {a, a, a, a, a, a, a, a, c, c} 

23. {a, a, a, a, b, c, c, c} 

24. {a, a, a, a, c, c, c, c} 

25. {a, a, a, a, a, c, c, c, c} 

26. {a, a, a, a, a, a, c, c, c, c} 

i 

42. {a, a, c, c, c, c, c, c, c, c} 



16 steps 
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In the next section, we further examine the effect of inputs and rule order. 
For this purpose, we adopt the above two kinds of order, Ojn and Oji 2 , because 
of the following two reasons: first, the behavior of Om is a typical cycle structure 
in 68 cases. Second, the behavior of Or 2 is completely different from the above 
one, although the ARMS applies the same rules in both cases. 



The Effect of Inputs and Rule Order. In the above section, we have assume 
that a is added after each rewriting step, with the ordering of rules fixed, and 
we have examined the effect of rule ordering. In this section, we only adopt the 
above two orderings: Ofa and Or 2 and we examine the effect of random inputs. 
After that, we introduce randomness in rule ordering and examine its effect on 
the system behavior. 

Characteristic Function. In order to incorporate randomness, we introduce char- 
acteristic function as follows. 



Characteristic Function. Let a; be a uniform random number selected in the 
interval [0, 1). Then a characteristic function hs{x) is defined as: 



hs{x) 



1, X < 6, 
0, X > S, 



where S denotes a threshold from the following set, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 
0.7, 0.8, 0.9 . 

For example, when S is equal to 0.9, then the frequency, with hs{x) positive, 
is equal to 0.9. Thus, when this function is applied to inputs, a will be input for 
nine of ten rewriting steps, which also means that no input will be done in one 
of ten steps. 

On the other hand, when the characteristic function is applied to rule order, 
the rule order will not be perturbed in nine of ten rewriting steps, which means 
that the rule order is perturbed in one of ten steps. For example, let us consider 
the case of Om. When the sixth step in ten rewriting steps is perturbed, the 
rules are applied as shown in the following sequence: ri, r 2 , rs, r 4 , rs, r 2 (re), 
xi, r 2 , T 3 , T 4 , where rg is changed to T 2 . Thus, both input randomness and rule 
randomness are equal to 1 — hs{x). 



Experimental Results. Using the above two kinds of ordering, we perform the 
following three kinds of experiments: 

(1) Decrease the frequency of inputs, with the rules order fixed. 

(2) Introduce randomness in the rules order, with inputs in every step. 

(3) Decrease the frequency of inputs, and introduce randomness in the rules 
order. 

In each case, we have performed 100 trials, each of which consists of 5005 
rewriting steps, and we have measured the effect on emergence of cycles, extinc- 
tion of cycles, and steps needed for termination. 
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Table 4. Relation between Frequency of Inputs and Termination of Calculus 



Frequency of Input 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 



Ori 


0 0 0 0 0 0 


0 0 0 (%) 


Or2 


100 100 100 100 98 13 


0 0 0 (%) 


Rule Order: Fixed 




Length of Cycles 

Fig. 2. An Illustrative Example in Ori 



Decrease Frequency of Input, with Rule Order Fixed. In the above three kinds of 
experiments, the most interesting one is the case when the frequency of inputs 
is decreased, with the rules order fixed. In the cases of the rules order Om, 
the rewriting steps are not terminated in all the trials, and cyclic structures 
also emerge. Interestingly, the period of cycle varies from 7 to 30 steps, and a 
period-doubling of cycles is observed. Figure 2 shows an illustrative example in 
the experiments, where the frequency of inputs is 0.2. The vertical axis indicates 
the number of cycles which emerge in 100 trials, and the horizontal axis shows 
the period length of cycles. In this figure, the peaks of the period length of cycles 
are located around 7, 13, 20, and 28 steps, which shows a period-doubling and 
for each period-doubling, a fusion of cycles is observed. On the other hand, in 
the case of the rules order Or 2 , unstable cycles emerge and rewriting steps are 
terminated when the threshold is larger than 0.5. However, when the threshold 
is equal to 0.4, almost all rewriting trials are not terminated, and cycles become 
stable. Moreover, in this case, the variability of the period length of cycles is the 
largest among all trials. Figure 3 shows the distribution of the period length of 
cycles; the notation is the same as in Figure 3. The peaks of the period length 
of cycles is located around 11, 23, 34, 46, 57, and 67 steps, which indicates a 
period-doubling. If the threshold is less than 0.3, all the trials are stable, similar 
to the case of the rules order Ori. These results are summarized in Table 4. 
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Table 5. Relation between Randomness of Rule Order (1 — 5) and Nnmber of Emerging 
Cycles 

1-5 0.1 0.3 0.5 0.7 0.9 

Om 1300(5) 325(5) 106(5) 106(5) 23(5) 

Or2 1269(11) 1118(5) 335(3) 223(3) 300(3) 

A(B) : A and B denote the nnmber of emerging cycles, 
and the median of the length of cycles, respectively. 



200 
150 

No. 100 
50 
0 

0 20 40 60 80 100 

Length of Cycles 

Fig. 3. An Illustrative Example in Or 2 '- Frequency of Input: 0.4, with Rule Order 
Fixed. 




Randomness in Rule Order, with Inputs in Every Step. In these experiments, for 
the rules order Om, cycles become unstable and a period-doubling is observed 
when the randomness of the rules order is low. Figure 4 shows the distribution 
of the period length of cycles when randomness of rule order is equal to 0.2; the 
notation is the same as in Figure 2. 

However, as randomness of the rules order grows, the number of emerging 
cycles decreases exponentially. Table 1 shows this tendency, the relation between 
randomness of rule order and the number of cycles emerging in 100 trials. When 
the randomness is larger than 0.5, almost the all trials generate only one cycle. 

On the other hand, the behavior of cycles in Om is different. As shown in ta- 
ble 1, the number of emerging cycles decreases exponentially, as the randomness 
of the rules order grows. However, the behavior of cycles when randomness of 
rule is higher than 0.5 is different from that when the randomness is lower than 
0.3: the median of the period length of cycles in the latter cases is half smaller 
than that in Or\ , while that in the former case is equal to or twice than that in 
Ora- Interestingly, trials in the case of Ora have two phase transitions: one is 
between 0.1 and 0.2, and the other is between 0.3 and 0.5. 
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Length of Cycles 



Fig. 4. An Illustrative Example in Ori \ Input: Fixed, Randomness of Rule Order: 0.2. 

Figure 5 shows the distribution of the period length of cycles in Or 2 when 
randomness of the rules order is equal to 0.1. It is notable that the cycles whose 
period length is 11.0 are dominant over those whose period length is 5.0. 

Decrease Frequency of Input and Increase Randomness in the Rules Order. In 
the final cases when the frequency of inputs is decreased and randomness is 
introduced in the rules order, the obtained results are straightforward. In the 
cases when randomness is introduced in the rules order with an input in every 
step, cycles become unstable in Or\, and only unstable cycles emerge in Or 2 - 
However, in both cases, the number of emerging cycles decreases exponentially, 
as randomness of the rules order grows. Interestingly, the behavior of cycles in 
Ori is very similar to that in Or 2 , except when the frequency of inputs is very 
high and randomness of the rules order are very low. 

Fusion of Cycles. A special case in the rules order Or 2 , with the frequency of 
inputs 0.4, generates a complex behavior of cycles. A closer examination shows 
that a core cycle is generated first and that other cycles around this cycle fuse 
into a core cycle. 

The generated core cycle Cg is given as follows: 

cg : 8 ^ 9 ^ 10 ^ ^ 18 ^ 8 : {llsteps), 

where each number denotes the number of rewriting steps. Around the Cc, the 
following cycle C54 is also observed: 

C54 : 54 ^ 55 ^ 56 ^ > 66 ^ 54 : (ISsteps), 

It is impossible to transit from State 8 to State 54 and State 66 to State 11, with 
the input fixed, but it will become possible when the randomness of the input is 





310 



Yasuhiro Suzuki et al. 




Fig. 5. An Illustrative Example in Or 2 '- Input: Fixed, Randomness of Rule Order: 0.1. 



introduced. Then, cg fuses into cn, and a new cycle will be generated as follows: 

Cc : 8 ^ 54 ^ > 66 ^ 11 ^ > 8 : (21steps) 

Thus, the fusion of cycles depends on the randomness of the input: the ran- 
domness of the input influences the application of rules, and then the change 
of application of rules causes the change of the structure of a multiset, then it 
opens the hidden pathway to fusion of cycles. 

Although the mechanism of fusion of cycles is not fully studied, it is notable 
that even such a simple structure produces a complex behavior of cycles. A 
further research on fusion, such as the classification of cycles, will be the object 
of a future work. 

3.2 Brruselator Model on ARMS 

We have performed an experiment about the Brusselator [43] model by using an 
ARMS with a free transition. The Brusselator is a model of a chemical oscillation 
related to the Belousov-Zabotinsky reaction (figure 6). 

We interpret these abstract chemical reaction equations as the rewriting rules 
shown in figure 7. 



A 

B + X'^ Y + D 
2X+ 3A 
A He 



Fig. 6. Abstract chemical model of Brusselator. 
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X : n 

B YD :r2 

XX XX X-.T3 

X^ E : r4 

Fig. 7. Rewriting rules for Brusselator. 



The ARMS for simulating Brusselator is defined as follows; 
r = {A,R), 

A = {A,B,C,D,X,Y}, 

R = Rewriting rules for Brusselator. 

In this simulation, the reaction rate corresponds to the frequency of rule 
application. If ri has the highest reaction rate, then ri is applied at the highest 
frequency. 

Simulation of the Brusselator Model. Let us examine the relationship between 
the frequency of rule application (reaction rate) and the concentration of X and 
Y in the multiset. The concentration of X and Y in the multiset is indicated by 
the number of X and Y present in the multiset. 

As to the initial condition, we assume that the maximal multiset size is equal 
to 5000 and the initial state of the multiset is the empty multiset. We assume 
that the system gets inputs A and B continuously. Thus, this model can be 
regarded as a continuously- fed stirred tank reactor (CSTR). 

In this simulation, we confirmed that oscillations between the number of 
X and Y in the multiset emerged. Furthermore, we discovered three types of 
oscillations, as follows: 

(1) divergence and convergence (figure 11, figure 8), 

(2) quasi-stable oscillations (figure 9) and 

(3) unstable oscillations (figure 10). 

The oscillation in the number of X and Y emerges as follows: first, the number 
of X increases; when the number of X decreases, the number of Y increases. We 
say that this type of oscillation is a A- T oscillation, while oscillations in the 
number of X or Y are called A-oscillations and Y-oscillations, respectively. 

Divergence. When the frequency of the application of rule T 2 is much larger than 
that of ri, for example 0.1 for ri and 0.45 for V 2 , the system becomes unstable 
and in many cases it diverges. 

In this case, a system generates X-Y oscillations (figure 11). The system 
diverges easily by perturbation and it is difficult to predict whether we get 
oscillations or divergence. 

Convergence. When the frequency of the application of r\ is much larger than 
that of C 2 , for instance, 0.1 versus 0.04, the system converges (figure 8). In this 
case almost all the oscillations are X or Y-oscillations, and only a few X-Y 
oscillations can be found. 
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(Quasi) Stable Oscillations. When the frequency of the applications of ri is 
slightly larger than that of r 2 , for example, 0.1 versus 0.15, the system is able 
to generate X-Y oscillations. Even if we introduce perturbation to the system, 
it maintains this type of oscillation (figure 9). 

In this case, X or Y-oscillations decrease and X-Y oscillations increase, and 
vice versa. X- Y oscillations can be classified into the following three categories; 

(1) The highest peak of the number of Ys or Is is between 35 and 50. 

(2) The highest peak of the number of Ys or Is is between 15 and 35. 

(3) The highest peak of the number of Ys or Is is between 1 and 15. 

Each type (1), (2), (3) of oscillation appears irregularly. In other words, in this 

case the system generates a limit cycle, or an “attractor” . 

Unstable Oscillation. When the frequency of T 2 is slightly larger than that of r\, 
for instance, 0.1 versus 0.35, the system exhibits an unstable oscillation. In this 
case, there are a few Y or Y-oscillations and almost all the oscillations become 
Y- Y oscillations. A great variety of Y- Y oscillation can be found (figure 10). 
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Fig. 10. Example of Unstable oscillation 
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4 Theoretical Analysis of ARMS 

In this section we address some theoretical aspects of ARMS. First, the order 
parameter that describes the degree of emergence of cycles is introduced. This 
parameter is related to the Langton’s A parameter and Edge of Chaos. After 
that, we consider rewriting systems from the viewpoint of stochastic transitions 
of states. 

4.1 Order Parameter of an ARMS 

From an examination of the effectiveness of the termination property, we ob- 
tained the Ae parameter as an order parameter for the qualitative behavior of 
ARAIS. 

We define “order” here as being given by the diversity of cycles. Thus, in 
this paper, “ordered state” refers to a case where the system yields simple cycles 
(such as the limit cycle), while “disordered state” refers to the case where the 
system yields chaotic or complex cycles. 
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The Xe Parameter. Let us define the Ag parameter as follows: 

, _ XJrAs>o 

1 + {SrAs<o — 1 ) 



( 1 ) 



where XJrAS>o corresponds to the number of heating rules used, and XJrAs<o to 
the number of cooling rules used. This parameter is defined when the number of 
rules used is greater than 1. 

When the ARMS only uses rules of the type rAS<o, Xe is equal to 0.0. On 
the contrary, if the ARMS uses rules of the type rAS>o and rAs<o with the same 
frequency, Ag is equal to 1.0. Finally, when the ARMS only uses rules of the type 
^AS> 0 t Ag is greater than 1.0. 



Simulation. We confirmed the appropriateness of the Ag parameter through a 
simulation of the ARMS, and verified that the parameter reflects the diversity 
of cycles that are generated by the system. 

Setup. A simulation was carried out under the following environment: In this 
simulation, an ARMS with a free transition is used. We intend to focus on 
qualitative features of the rewriting rule, while not assuming any inputs. A rule 
was selected randomly according to the following protocol: the probability of 
selecting a rAS>o rule is given by the probability 0 < p < 1, while a rAs<o rule 
is selected with probability (1 — p). 

The ARMS is defined as follows; 

r = (A,R), 

A = {a,b}, 

Rules. The length of the left- or right-hand-side of a rule was between one and 
five. Both sides of the rules were obtained by sampling with replacement of 
two symbols a and b. A set of rewriting rules was constructed as the overall 
permutation of both sides of the rules. The number of rules is given by Equation 
(2), where n corresponds to the kinds of symbols that we take and m is the range 
of lengths of strings. 

k=l ^ ' 

As Equation (2) illustrates, if many kinds of symbols are used, the number of 
rules increases rapidly. That is why we use two symbols in this simulation. We 
assume that the length of strings can range between 1 and 5. Then, the number 
of rules for this simulation is equal to 30976. 

Method of Simulation. We assume that the maximal multiset size is 10. At the 
beginning of a simulation, the value of p is set to 0 and it is increased by steps 
of 0.01. At each value of p, 100 new initial states with base number between 1 
and 10 are generated by selecting the symbol a or & randomly. The base number 
of the initial state of a multiset is decided randomly. For each initial state the 
simulation is performed for 1000 steps. 
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Fig. 12. The correlation between the number of terminating calculations and p 



Experimental Results. Let us present the experimental results, focusing on 
the following two points: 

(1) the correlation between the system’s terminating property and the value of 
P, 

(2) the correlation between the diversity of periods of the generated cycles and 
the value of p. 

Termination. Figure 12 illustrates the correlation between p and the number 
of terminating calculations. In Figure 12, the vertical axis corresponds to the 
number of terminating calculations and the horizontal axis corresponds to p. 

In this simulation, before p exceeded 0.1, most calculations terminated. When 
p exceeded 0.1, the number of terminating calculations decreased rapidly, while 
when p was greater than 0.2, this decrease leveled off. Then, for p between 0.3 
and 0.85, only a few calculations terminated, while with p greater than 0.85, the 
number of terminating calculations increased rapidly again. In other words, for 
p near 0 or 1, the system strongly terminated, while with when p away from 0 
or 1, the termination property became weak. 

The Number of Generated Cycles. Figure 13 illustrates the relationship between 
the number of generated cycles and p. 

As p increases, the number of generated cycles also increases rapidly. This 
rise levels off when p exceeds 0.3. For p between 0.3 and 0.8, the number of 
generated cycles remains at the same level, however, when p exceeds 0.75, this 
number rapidly decreased again. 

This result indicates that the termination property indeed is related to the 
number of generated cycles. As we mentioned in the previous section, when p 
was close to 1 or 0, the system strongly terminated while for p away from 0 or 
1, this property became weak. Also, for p near 1 or 0, only a few cycles were 
generated while for p away from 1 and 0, the system generated many cycles. 
We may thus conclude that the degree of termination influences the number of 
cycles generated by the system. 
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Fig. 13. Correlation between the nnmber of generated cycles (vertical axis) and p 
(horizontal axis). 



It is interesting that once the number of generated cycles reaches around 450, 
it remains at the same level, even while p was changed from 0.3 continuously up 
to 0.8. We believe that for p between 0.3 and 0.8, the system is in an equilibrium 
state. 

To investigate the system’s behavior in this equilibrium state, let us examine 
the relationship between the kinds of periods generated by the cycles and the 
value of p. 



The Kinds of Periods Generated by the Cycles. The experimental result indicates 
that even if the system is in an equilibrium state, the kinds of periods are different 
for each value of p. Figure 14 displays the average number of different kinds of 
periods. As we can see in this figure, when p reaches about 0.5, the number 
of different kinds of periods is maximal. In other words, when cooling rules 
and heating rules are used at the same frequency, many kinds of periods are 
generated. 



Discussion. We shall show that the Ag parameter is related to the termination 
property, which implies that Langton’s A parameter is also related to termina- 
tion. 

Before discussing this issue, let us describe cellular automata, Langton’s A 
parameter [36], and Wolfram classes [66] in more detail. Then, we display the 
relation among these parameters and Wolfram classes. 



Cellular Automata. A Cellular Automaton (CA) is a discrete mathematical 
model [65] whose behavior is caused by the interactions among neighboring sites. 
This model has been applied to various fields, for instance, statistical mechan- 
ics, mathematical biology, public hygiene (as a model of infectious disease, for 
example), in medicine and so on. 
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Fig. 14. Correlation between the kinds of periods of generated cycles (vertical axis) 
and p (horizontal axis). 



Formally, a cellular automaton is a D-dimensional lattice automaton. Each 
lattice site is updated by a deterministic rule involving a local neighborhood of 
sites, in discrete time steps. 

The possible state of a site is chosen from the alphabet K = {0, 1, . . . , fc — 1}, 



and the value of site i at step t is denoted by af ^ . 


. At each time step, each site 


value is updated according to the particular neighborhood of N sites around it 


by a local rule 










(3) 


of the form 






— r ) “1 — r-l-1 1 ■ ■ • > 


ai+r^}- 


(4) 


This local rule leads to a global mapping 










(5) 


on all cellular automaton configurations. Then, in 


general 




Qit + l) ^ g Q(t) 


1 


(6) 


where 










(7) 



is the set of configurations generated after t iterated applications of (Wolfram 
[65], Langton [36]). 

For example, a local rule for the one dimensional cellular automaton with 
neighborhood two is: 



000 001 010 Oil 100 101 110 111 
0 1 0 0 1 0 0 0 . 



By using this rule, a step of global mapping is; 



000101011011100 

0100000000001 . 
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Qualitative Characterizations of CA. Wolfram’s classification and Lang- 
ton’s A parameter as qualitative characterizations of the behavior of cellular 
automata are well known. 



Wolfram Classes. Wolfram ( [66]) proposed four classes of qualitative behavior 
patterns of cellular automata based on his investigation of a large sample of CA 
rule tables. He maintained that any cellular automaton would fall into one of 
the four basic classes: 

Class 1: Evolution leads to a homogeneous state. 

Class 2: Evolution leads to a set of separated simple stable or periodic 
structures. 

- Class 3: Evolution leads to a chaotic pattern. 

Class 4: Evolution leads to chaotic, localized structures, sometimes long- 
lived. 

These four classes lead to an analogy between the CA’s behavior and the clas- 
sification of dynamical systems: 

- Class 1: Limit points. 

Class 2: Limit cycles. 

- Class 3: Chaotic behavior of the kind associated with strange attractors. 
Class 4: No direct analogue. 

With respect to Class 4, Wolfram suspected that CA in this class are capable of 
universal computation, so that properties of its infinite-time behavior would be 
undecidable [66]. 



Langton’s A Parameter. Langton criticized Wolfram’s classification by saying: 
“it is obvious that such a classification can only serve as rough approximations 
to the more subtle, underlying structure” (Langton [36], p.46), and proposed 
his A parameter to describe CA rules, to obtain “a deeper understanding of the 
structure of cellular automata rule spaces, one that provides an explanation for 
their relationships to one another” (Langton [36], p. 46). The A parameter is 
defined as follows: 



A = 



-Uq 



(8) 



where K corresponds to the number of symbols used, N to the size of the neigh- 
borhood, and riq to the number of local rules which transform cellular automata 
to the quiescent state, respectively. The quiescent state is picked arbitrarily, and 
is usually associated with a “special” state, such as the “zero” state. If all rules 
transform to the quiescent state, then = Uq, and the the A parameter is 
equal to 0. When no rules transform to the quiescent state on the other hand, A 
is equal to 1. 

The values A = 0 and X = 1 — ^ represent the most homogeneous and the 
most heterogeneous rule tables. As the value of the A parameter increases, the 
dynamical activity of cellular automata becomes chaotic. The correspondence 
between the value of the A parameter and the dynamical activity of cellular 
automata is as follows: 
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fixed point ^ periodic ^ “complex” ^ chaotic. 

In terms of the Wolfram classes, the sequence is: 

Class 1 Class 2 Class 4 Class 3. (9) 

Langton ( [36]) demonstrated that the complex rules are located in between the 
periodic and the chaotic rules, and that there is a clear phase-transition between 
periodic and chaotic behavior. 

Computational Algebraic Characteristics of A Parameter. We demon- 
strate here that the A parameter indicates a degree of termination and that when 
cellular automata yield complex or chaotic behavior, this termination property 
of the system becomes weak. When A is close to 0, the dynamical activity of 
the cellular automaton quickly dies out or reaches a uniform fixed point. In this 
regime, thus, the termination property of cellular automata is strong. As the 
value of the A is increasing, cellular automata evolve to periodic structures or 
chaotic aperiodic patterns. As the value of the A increases even more, the ter- 
mination property becomes weak. The larger the parameter becomes, the later 
the calculation terminates. Since the transitional space of cellular automata is 
finite, if a calculation does not terminate or is difficult to stop, cyclic structures 
must emerge in the process of the computation. 

When A is equal to 0.125, for example, we could find the following rule: 

000 001 010 on 100 loi no in 
0 0 1 0 0 0 0 0 . 

As this parameter increases, the rule might change to that given below: 

000 001 010 on 100 loi no in 
1 0 1 0 0 0 0 1 . 

The A parameter of this rule is equal to 0.375. As the value of A parameter 

increases, the number of rules such as, 

***(* = 0 or 1) 

0 , 

decreases while, on the other hand, the number of rules such as, 

***(* = 0 or 1) 

1 , 

increases. The CA rule yields a complex or chaotic pattern when A is near 1— 
and rules that lead to the quiescent state are used at the same frequency as rules 
that lead away from that state. 

Relation between Ae, A, Wolfram Classes, and the Termination Prop- 
erty. The qualitative dynamics of CA and ARMS that we have described is 
summarized in Table 6. It suggests that each parameter and class indicate a 
degree of terminating. 
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Table 6. Relation among parameters, complexity classes 



termina 

-ting 


Ae 


A 


Wolfram 

classes 


Strong 


Ae 0.0, 

Ae > 1.0 


A 0.0 


I or II 


Fair 

strong 


1.0 > Ae > 0.0, 
Ae > 1.0 


0.0 < A < 1 - T 


II or IV 


Weak 


Ae ^ 1.0 


A-1- A 


III 



Comments on the ‘Edge of Chaos’-Regime. We have demonstrated that 
an equal frequency of rAs>o rules to rAS<o rules yields dynamical patterns for 
both cellular automata and ARMS. We can now see that the principle of “edge of 
chaos” (Langton [36], Kauffman [32]), according to the dynamics just described, 
results from a biased ratio of tas>o rules to rAs<o rules. 

If these two types of rules are used at the same frequency, then the system 
yields a chaotic pattern. However, when the ratio changes slightly, the “edge 
of chaos” regime emerges. Thus, a lack of symmetry in the application of rules 
generates the diversity of cycles in these systems. 

4.2 Markov Analysis and Beyond 

In this section we consider rewriting systems from the viewpoint of stochastic 
transitions of states. Specifically, it is shown that the dynamics of a rewriting 
system, in some situations, can be analyzed as a Markov process. In the limit of 
large system size, one obtains a set of deterministic differential equations, which 
often provides a useful tool, complementary to the computational analysis. We 
also describe what are not assumed and are difficult to examine in the Markov 
analysis, and shall shed light on some unique aspects of ARMS computation. 

As we also said above, a multiset over a set of n objects can be identified with 
a vector in the n-dimensional space. Consider the alphabet A = (oi, • • • , a„); a 
state can be expressed as 

01^102®^ < > X = {xi,X 2 , - ■ ■ ,Xn), (10) 

where all components Xi are integers. 

Each rewriting step in an ARMS is a state transition, which can be described 
as a displacement of the state vector: 

X — > X + r. (11) 

The displacement vector r = (ri, r 2 , • • • , r„) simply represents how the number 
of each object Ui changes in a single rewriting step. 

Consider the Brusselator system from Figure 6, which we shall use as a 
working example throughtout this section. Since we are interested in the behavior 
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of the numbers of X and Y , a state is a two-dimensional vector x = (cc, y) where 
X and y are the numbers of X and Y respectively. The system has four rules, 
namely A ^ X , B + X -^Y + D, 2X + Y iX, and X ^ E, which correspond 
to the four transitions: 



n = (l,0), r 2 = (-l,l), r 3 = (l,-l), r 4 = (-l,0), (12) 



respectively. 

Now it would be natural to assume that in a single step of rewriting rules 
are applied randomly with probabilities depending on the present state and each 
rule. For the above example, a possible model of the transition probabilities 
might be as follows: 

Prob(a: ^ x + r^) = k\ At, 

Prob(a: ^ x + r^) = k 2 X At, 

Prob(a: ^ a: -|- ra) = k^ x'^y At, 

Prob(a; ^ x + r 4 ) = k 4 X At, 

Prob(a; ^ a?) = 1 — (fci -I- k 2 X + k^x^y + kix)At, (13) 

where At is the small interval of time in which a single step of computation takes 
place. For example, the probability of the third rule application is proportional 
to the combinatorial number for encounters of the three chemicals, X, X and Y. 

We should emphasize the fact that it is our assumption that one can treat a 
rewriting system in such a stochastic manner. Especially, note that the order of 
applications of a sequence of rewriting rules is supposed to be not important. Un- 
der this assumption, the transition probabilities depend not on the remote past 
of the history of states but on the state of one-step before. That is, the stochastic 
process is Markovian, and for studying such processes we have abundant tools. 
Let us briefly recapitulate a few results of the Markov analysis in what follows 
(see [31], [27], for example). We shall see how a set of differential equations such 
as Equation (26) shown below, which is often written down rather intuitively 
and immediately from Figure 6 and Equation (13), is derived. 

Suppose that at time to the state is Xq. One can consider the conditional 
probability distribution for later time t > to, P{x,t\xo,to)- Given the transition 
probabilities, one can write the so-called master equation for the conditional 
probability distribution by using the Markov property mentioned above. Using 
the abbreviation, P{x,t) = P{x,t\xo,to), this is 

W{x\z)P{z, t)-J2 W{z\x)P{x, t), (14) 

2 ; 2 : 

where W{x'\x) is the transition probability from x to x' per unit time. 
in Equation (14) represents the summation over all possible transitions except 
z = X. The master equation should be accompanied with appropriate boundary 
conditions such as vanishing probability for a negative number of some Xi. 
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Usually an ARMS is working for a relatively large number of objects, say 
dozens and hundreds. For such a large system, one has an approximate treatment 
of the master equation, namely the system size expansion [31]. 

First, let us have a formal expansion of Equation (14). We use the notation 



t{Ax,x) = W{x + Ax\x), (15) 

then rewrite Equation (14) as 

= Y^[t{r,x- r)P{x -r,t)~ t{r,x)P{x,t)] 
r 

oo . 

= — (-r • Va;)"[t(r,a:)P(a:,t)], ( 16 ) 

r u=i 

where Taylor expansion was done in the second equation, and note that 
means the summation over all possible rules. We also define 



rVx = 



E 



d 

dxi 




(17) 



Henceforth, as shown in the second definition of Equation (17), summation over 
repeated indexes is assumed unless otherwise stated. Up to the second order. 
Equation (16) reads 

[A,(a.)P(a.,t)j + [B,,{x)P{x,t )] , (18) 



where 



Ai{x) = t{r, x), (19) 

r 

B^{x) = ^r,r:it{r,x). ( 20 ) 

r 

We introduce the system size parameter 12, which represents the typical 
number of objects, or chemicals involved in the system. Note that the typical 
size of the jump Axi is some small integers, say ±1, compared with 12. In the 
transition probability expressed in Equation (15), it would be natural to assume 
that the functional form is such that 



t{Ax,x) = [2tp{Ax,x/f2). (21) 

That is, the transition probability is proportional to 12 and is a function of “con- 
centration” x/f2 and displacement Ax. For example, in the Brusselator system 
described by Equation (13), Equation (21) requires an appropriate dependence 
of each ki on 12. 
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It turns out to be useful to express the state variable x as 

X = Q 4> + Vf2 z, 



(22) 



in terms of “deterministic” variable <p and “fluctuation” variable 2 :. In fact, as 
easily verified, under the change of variable by Equation (22), the lowest-order 
(■\/7?) of Equation (16) or Equation (18) is 

^ = ( 23 ) 

and the second-order (17°) reads 

= -d^Md>{t))^^[z,Piz,t)] + Is,, (0(f)) ^^1^, (24) 



where P{x, t) = P{x{z, t),t) = P{z, t). Also defined are Ai{x) = QAi{x/fl') and 
Bij{x) = flBijix j if) in accordance with Equation (21). djAi{-) is the partial 
derivative with respect to the j-th component of the argument of Ai(-). 

Equation (23) gives the differential equation for the deterministic part, and 
Equation (24) describes the stochastic fluctuation around it. The latter would 
simply yield a Gaussian distribution around the deterministic trajectory, whose 
variance changes with time in most cases. A more interesting case is when the 
fluctuation grows rapidly in time, or the so-called anomalous fluctuation, which 
needs alternative treatment rather than that derived here. 

For the Brusselator system, the deterministic part is given by the differential 
equation for the state x = (x,y): 



dx{t) 

dt 

dy{t) 

dt 



= ki — k2 X + ks x^y 
= k2X — kz x^y 



ki X 



(25) 

(26) 



as easily found from Equation (12) and Equation (23). This result is not surpris- 
ing. One could write such a differential equation immediately from Equation (13), 
but rather intuitively. We have seen that it is actually the deterministic part of 
the stochastic process in the limit of large system-size. 

Depending on the paramters ki, Equation (26) has interesting behaviors [43]. 
Let us focus on the limit cycle corresponding to chemical oscillation for illustra- 
tion. Figure 15 shows a set of solutions for the parameters, k\ = 100, = 3, 

k-i = 10“^, /c 4 = 1. On the other hand, in the ARMS formulation, one has the 
result shown in Figure 16 for the same set of parameters with the same initial 
condition (cf. [57]). The simulation was done according to the set of rules in 
Equation (13) {At = 10“° for system sizes of hundreds). Comparison in the 
phase space of {x{t),y{t)) explicitly shows a similar behavior of limit cycle as 
depicted in Figure 17. 

Although the result should be accepted as natural, we think that the Markov 
approach can provide a useful tool and a non-trivial bridge between the com- 
putational approach and the viewpoint of dynamical systems. First of all, the 
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Fig. 15. Limit cycle behavior of Brusselator differential equation. Parameters: fci = 
100, k2 = 3, fcs = 10"^, k4. = 1. 





Fig. 16. Limit cycle behavior of ARMS for the Brusselator system. Parameters: fci = 
100, fcz = 3, fcs = lO’"*, ki = 1. 



comparison of ARMS with dynamical differential equations, such as illustrated 
as above, is practically useful. Several points can be immediately considered 
including 

- phase flow, 

“ qualitative change of phase flow depending on parameters, or bifurcation, 

- stability analysis, 

- asymptotic behavior, 

- analysis of small fluctuation around deterministic trajectories, 

and so on. These points might not be easy to study in the formulation of ARMS, 
and could also be examined from a computational viewpoint. Another interesting 
problem would be to consider the “computational ability” of ARMS from the 
dynamical viewpoint. 
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Fig. 17. Trajectories in the phase space (x{t), y{t)). The bold line and thin line corre- 
spond to the differential equation and to the ARMS simulation respectively. The initial 
values and the parameters are the same in both cases. 



We would, however, like to emphasize that such a Markov analysis as pre- 
sented here is only a complementary approach to understand certain aspects of 
ARMS. In particular, we should point out a few things. 

- The order of successive application of rewriting rules can be important, while 
it is assumed to be completely random in this section. 

- If the size of the system is not very large, say a dozen, one needs to ana- 
lyze the Markov process without a useful approximation, though possible. 
For a not-so-large system, novel behaviors of ARMS are expected rather 
than what are shown in the deterministic approach (for example, anomalous 
fluctuations) . 

- One could consider the case in which the transition probability does not 
obey Equation (21). How to apply a rewriting rule in ARMS is a non-trivial 
problem, and may cause different behaviors. 

A further study including these points is necessary and will shed light on unique 
aspects of ARMS computation. 

5 Application of ARMS for Biological Systems 

In this section, we will address following two applications of ARMS for investi- 
gating biological issues: 

- Origin of Life in terms of a computation. 

- Modeling an Ecological system. 



5.1 Origin of Life 

In order to develop a computational model of the origin of life, we use the concept 
of Chemical Autopoiesis proposed by L.Luisi [37]. 
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Since the seminal work of Oparin [45] the question of the origin of life has 
usually been discussed within the framework of molecular Darwinism. The main 
ingredient of the molecular Darwinism is a mechanism which leads to selection 
in a large variety of structures: starting from small molecules, compounds with 
increasing molecular complexity and with emergent novel properties (binding, 
recognition, catalysis, informations) would have evolved, until the most extraor- 
dinary of emergent properties, life itself, originated. A key step is the emergence 
of self-replication, since accidentally built structures would decay and disappear 
under prebiotic conditions. Based on our present knowledge, we ascribe self- 
replication of DNA and most of all the other functions (catalysis, recognition 
and transport, energy strange, and transduction) to proteins, although this last 
notion has hanged somewhat with the discovery of the catalytic power of RNA. 
Coevolution of nucleic acids and proteins in a system of mutually dependent 
macromolecules, able to replicate, is thus considered as the genesis of life [37]. 



Chemical Autpoiesis. Life, as we know it, is cellular and although in principle 
other life forms are possible, we will ignore them. A cell is characterized, first of 
all, by a boundary which discriminates the “self” from the environment. Within 
this boundary, life is a metabolic network. Based on nutrients coming from the 
outside world, a cell sustains itself by a network of reactions that take place 
inside the boundary and which produce the cell components, including those 
which are assembled as the boundary itself. Thus, the cell is self-generating (i.e., 
cellular components are synthesized by the cell) and self perpetuating (i.e., these 
components are in turn transformed into new compounds, which are part of the 
reaction network that continuously replaces them). 

Maturana and Varela tried to translate these general observations into scien- 
tific definition of “minimal life” [38]. The starting proposition is to define life as 
a unitary operation rather than as a structure. This unitary operation is what 
they have called an autopoetic unit [38] . An autopoetic unit is a system [38] that 
determines its own making, due to a network of reactions which take place within 
its own well-defined boundary. To use Varela’s words: “an autopoetic machine 
continuously generates and specifies its own organization through its operation 
as a system of production of its own components” [38] . 

Let us now turn to consider an autopoetic unit in chemical terms, qualifying 
for example the notion of boundary in terms of chemical structure and then 
qualifying the process of self-generation and self-perpetuation in terms of relative 
rate constants. As already mentioned, the notion of the operational closure is 
central: it refers to the containment of system operations within the system 
boundary. A boundary means a physical berried, which imposes a diffusion step 
on the entering/outgoing metabolites. 

To apply the concepts of autopoesis to concrete chemical systems, it is nec- 
essary to adhere to the original definition of boundary. Accordingly, the term 
boundary will be restricted to a three-dimensionally closed structure. This defi- 
nition eliminates other possible kinds of boundaries, sugh as an interface between 
two phase (liquid/liquid or liquid/solod), and the simple surface of a molecule: 




Abstract Rewriting Systems on Multisets 327 



thus intermolecular binding and recognition of molecules is not per se an autopo- 
etic process. Neither are autopoetic boundaries the surfaces of a growing crystal 
or the contours of cyclic molecules. Rather, vesicles, or micelles, are bounded 
in the autopoetic sense, in that they form tree-dimensionally closed structures 
with a membrane which is, in principle, able selectively to discriminate among 
entering/outgoing molecules. 

Having defined in more chemical terms the notion of boundary, let us now 
consider the simplest possible autopoetic system. A minimal autopoetic system 
would require a boundary composed by at least one component, C; it will be 
characterized by only one entering metabolite. A, and by two unimolecular re- 
actions, a self-generation reaction leading to C (at the expense of A); and a 
decomposition reaction which transforms C into a product P, which then goes 
out of the boundary. These chemical reactions are determined by the bounded 
unit, i.e., the transformation of A into C and of C into P take place only inside 
the boundary; we will assume that such spontaneous reactions are chemically 
irreversible (i.e., that we can neglect the back-reactions). 

Let us also assume, for the sake of simplicity, that all diffusion processes 
throughout the boundary, as well as the rate of self-assembling of C, are very 
fast with respect to the chemical transformations A ^ C and C ^ P. The 
system then produces its own component C, thus regenerating its own structure 

[37]. 

Luisi’s team experimentally implemented this principle in the self-replicating 
micellar system which is based on a reaction between a regent, which is strictly 
localized in the micellar interior, and a co-surfactant. The co-surfactant is present 
in large excess in the bulk, from where it is continuously delivered to the interface 
of the micelle. In this experiment, the reaction takes place at the interface of the 
micelles; the chemical reactivity is a direct result of the presence of a boundary. 
Then they realized an autopoietic situation, in which the product of the reaction 
is the consequence of the boundary constraints of the original structure, and the 
surfactant product then assembles spontaneously in the boundary itself [37]. 

5.2 Artificial Cell Systems 

We can develop a computational model related to Luisi’s chemical autopoesis 
system in terms of P systems, as introduced by Gh. Paun in [49]. Further details 
can be found in [8], [47], [10], [46], [48], etc. In this model, a membrane con- 
tains “chemical compounds” (denoted by symbols) which are processed through 
chemical reactions specific to the region/compartment defined by the membrane. 

P systems were introduced with computability goals. Based on the principles 
outlined above, we develop here an Artificial Cell System (ACS). It consists of 
a multiset of symbols, a set of rewriting rules (reaction rules) and membranes. 

A membrane is an important structure for living systems. It distinguishes 
“self” from its environment and hierarchical structures inside the system (like 
cells, organs and so on) are composed by membranes. Membranes change their 
structure dynamically and constitute a system. We are interested in their dy- 
namical structure in terms of computations. 
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Although not many researches have tackled this topic previously [39], [40], 
the focus of these researches is on the formation of a membrane. The aim of this 
study is to investigate the role of membrane in terms of computations, thus we 
do not treat its formation. 

By simulations, we find that a chemical evolution like a behavior emerges 
and cells evolve to a structure consisting of several cell-like membranes. We 
investigate the correlation among the type of reaction rules (rewriting rules), 
characteristic to a membrane and the evolution of cells, and then find the char- 
acteristics of a membrane effect on its evolution and the Ae parameter to describe 
the correlation. 



The Membrane Structure (MS). To describe the membrane and its struc- 
ture, we proceed as in [49] and we first define the language MS over the alphabet 
{[, ]} whose strings are recurrently defined as follows: 

(1) [,] gM5 

(2) if /ii, ..., fj,n G MS, n > 1, then [^i.../i„] G MS] 

(3) there is nothing else in MS. 

Consider now the following relation on MS: for x,y G MS we write a; ~ y if 
and only if we can write the two strings in the form x = [i...[ 2 ---] 2 [ 3 ---] 3 ---]i) 2/ = 
[i...[ 3 ...] 3 [ 2 ...] 2 -..]i, i.e., if and only if two pairs of parentheses which are not 
contained in one other can be interchanged, together with their contents. 

This is clearly an equivalence relation. We denote by MS' the set of equiva- 
lence classes oi MS with respect to this relation. The elements of MS are called 
membrane structures. 

It is easy to see that the parentheses [, ] appearing in a membrane structure 
are matching correctly in the usual sense. Conversely, any string of correctly 
matching pairs of parentheses [, ], with a matching pair at the ends, corresponds 
to a membrane structure. 

Each matching pair of parentheses [, ] appearing in a membrane structure 
is called a membrane. The number of membranes in a membrane structure /i is 
called the degree of fj, and is denoted by deg(/i). The external membrane of a 
membrane structure fj, is called the vessel; membrane of /r. When a membrane 
which appears in y G MS has the form [ ] and no other membranes appear 
inside the two parentheses then it is called an elementary membrane. 



ACS and ACSE. We will now define two types of ACS; 

(1) ACS, and 

(2) ACS with an Elementary membrane (ACSE). 
Descriptions of an ACS. A transition ACS is a construct 

T= (A,Ai,Mi,...,M„,i?,MC,<f,(T), 



where: 
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(1) A is a set of objects; 

(2) /i is a membrane structure (it can be changed during a computation); 

(3) Ml, M„, are multisets associated with the regions 1,2, ... n of /t; 

(4) i? is a finite set of multiset evolution rules over A. 

(5) MC is a set of membrane compounds; 

(6) 5 is the threshold value of dissolving a membrane; 

(7) a is the threshold value of dividing a membrane; 

/X is a membrane structure of degree n, n > 1, with the membranes labeled in a 
one-to-one manner, for instance, with the numbers from 1 to n. In this way, also 
the regions of /x are identified by the numbers from 1 to n. 

Rewriting rules are applied in following manner: 

(1) The same rules are applied to every membrane. There are no rules specific to 
a membrane (this is a difference from P systems, where the rules are specific 
to membranes, and correspond to the so-called P systems with global rules). 

(2) All the rules are applied in parallel. In every step, all the rules are applied in 
all membranes to all objects to which they can be applied. If there are more 
than two rules that can be applied to an object, then one rule is selected 
randomly. 

(3) If a membrane dissolves, then all the objects in its region are left free in the 
region immediately above it. 

(4) All objects and membranes not specified in a rule and which do not evolve 
are passed unchanged to the next step. 

The set R of rewriting rules is a finite set of multiset rewriting rules over 
A. Both the left and the right side of a rule are obtained by sampling with 
replacement of symbols. A set of reaction rules is constructed as the overall 
permutation of both sides of the rules. 



Input and Output. Chemical compounds are supported from outside of the sys- 
tem to and some compounds are exhausted from the system. All chemical com- 
pounds are transferred among cells, a randomly selected chemical compound is 
sent into the membrane just above or below it. Although a membrane does not 
allow specificity of transport across the membrane, a cell can control its chemical 
environment by chemical reactions. 



Dissolving and Dividing a Membrane of an ACS. A membrane has a contexts 
specified as a multiset of symbols. To maintain a membrane, it needs to have a 
certain minimal volume. A membrane disappears if the volume of its contents 
decreases below the needed volume to maintain it. Dissolving the membrane is 
defined as follows: 



where the ellipsis {...} illustrates chemical compounds inside the membrane. 
Dissolving takes place when 



\Wj\MC 

m 



< 6 
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where <5 is a threshold value for dissolution. All chemical compounds in its region 
are then set free and they are merged into the region immediately above it. 

On the other hand, when the volume of the membrane contents increases to 
a certain extent, then a membrane is divided. Dividing a membrane is realized 
by dividing it in multisets of random sizes. The frequency at which a membrane 
is divided is decided depending on its size. As the size of a multiset becomes 
larger, the cell is divided more frequently. Technically, this is defined as follows: 






Dividing takes place when 



\wh\MC 

\Mh\ 



> a 



where cr is a threshold for dividing the membrane. All chemical compounds in 
its region are separated randomly into new membranes. 



Description of an ACSE. An ACSE is different from an ACS only in the way of 
dividing and dissolving cells. Dissolving the membrane is defined as follows: 



[ha,b,...]h [oa, 6, ...,]o 



Dissolving takes place when 






MC 



\Mh 



<6 



where i5 is a threshold value for dissolving the membrane. All chemical com- 
pounds in its region are then set free and they are merged into the region of 
Mo. 

Dividing is defined as follows; 






Dividing takes place when 



\wh\ 



MC 



\Mh\ 



> cr, 



where cr is a threshold for dividing the membrane. All chemical compounds in 
its region are separated randomly in the old and new membranes. Hence, in an 
ACSE, a structured cell such as [a, b[c, [d, e]]] does not appear. 



Evolution of Cells. When a cell grows and it exceeds the threshold value for 
dividing, it divides into parts of random sizes. This can be seen as a kind of 
mutation. If a divided cell does not have any membrane compounds, it must 
disappear. 

Furthermore, to maintain the membrane through chemical reactions inside 
the cell can be seen as a natural selection. If a cell cannot maintain the membrane 
compounds, it must disappear. 

Thus, both dividing membranes and dissolving membranes produce evolu- 
tionary dynamics. These correspondences are summarized as follows: 
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Natural Selection 


Dissolving a membrane. 


Mutation 


Dividing a cell into parts of 
random size. 



Behavior of ACSE and ACS. In this section, we will show some experimental 
results about ACSE and ACS. 

ACSE. The evolution of elementary cells can be regarded as an approximate 
model of the chemical evolution in the origin of life. 

To check this intuition, the following ACSE was simulated: 

E = {A = {a,6,c},/t = { [,]o,...[,]ioo}Mo = {[a^^ = 

{6},,5 = 0.4,ct = 0.2), 

where: 

(1) R, the length of the left- or right-hand-side of a rule is between one and 
three. Both sides of the rules are obtained by sampling with replacement of 
the three symbols a, b and c; 

(2) The membrane structure are assumed to be = {[i]i..-[ioo]ioo})- 

Through the simulation we discovered that the strength of a membrane affects 
the behavior of cells. The strength of a membrane is defined as the frequency of 
decreasing membrane compounds. 

When a Membrane is Strong. When a membrane is strong, the most stable cell 
consists of only one membrane, cells of this type become “mother” cells and they 
produce “daughter” cells. 

In order to display a state of a cell we represent it by a number by using 
the following transformation function: f {M (a) , M (b) , M (c)) = 10^ x M{a) + 
10^ X M{b) + 10° X M{c). For example, the state {a, a, b, c, c} is transformed into 
10^ X 2 -b IQi X 1 -b 10° X 2 = 212. 

Figure 18 illustrates the evolution of cells when a membrane is strong. 

The cells that are close to the horizontal axis are mother cells. Some daughter 
cells depart from the group and evolve different types of cells, even though almost 
all cells are in the group. In this case, dissolving a membrane compound takes 
place per 100 steps. 

Figure 19 is focused to the mother cells. 

At the beginning there are about ten groups, and some of them become 
extinct: after 200 steps there remain about four groups. 

When a Membrane is Weak. Figure 20 illustrates the case when a membrane is 
weak, and it dissolves every 3 steps. 

In this case, the system cannot form a group of mother cells such as in the 
previous case. Since the group of cells drifts to more stable cells, the cells grow 
larger. Even if a large cell divides into parts of random sizes, the probability 
of including enough membrane compounds to maintain its membrane is larger 
than for a small cell. 
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Fig. 18. When a membrane is strong. The lines illustrate the regions where cells exist 
and points correspond to the state of cells. 




Fig. 19. Evolntion of mother cells. The points correspond to cells. 



We believe that this behaviors of evolution is similar to the evolution of 
viruses [59]. The settings of this simulation are so rough, however, that the 
possibility remains open that chemical evolution in origin of life is similar to 
virus evolution. This issue will be addressed in a future research. 



The Correlation between the Behavior of an ACS 
and the Characteristics of Rewriting Rules 

Description of a Simulation. Next, the following ACS was simulated: 

r = {A = {a, 6, c}, /X = { 0 }, Mo = {a^°, c^O}, R, MC = {b}, 6 = 0.4, a = 

0 . 2 ), 

where R, the length of the left- or right-hand-side of a rule is between one 
and three. Both sides of the rules are obtained by sampling with replacement of 
the three symbols a, b and c and membrane structures are not assumed. 

Ae Parameter. In order to investigate the correlation between the rewriting 
rules and the behavior of the model, we will introduce the Ae parameter [54]. Ae 
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Fig. 20. When a membrane is weak 



indicates the degree of reproduction in a cell. When Ae close to 0.0, the degree 
is quite low and as Ae is getting larger than 1.0, the degree becomes higher. 



Let recall the Ag parameter here. It was defined as follows; 

, _ SrAS>o 

1 + {Stasko — 1 ) 



(27) 



where SrAS>o corresponds to the number of heating rules, and SrAs<o 
to the number of cooling rules. This parameter is well-defined when the 
number of rules is greater than 1. 

When the ARMS only uses rules of the type rAS<o, Ae is equal to 0.0. 
On the contrary, if the ARMS uses rules of the type rAS>o and rAS<o 
with the same frequency, Ae is equal to 1.0. Finally, when the ARMS 
only uses rules of the type rAS>o^ Ag is greater than 1.0. 



Ae Parameter and System’s Behavior. The behavior of ACS is classified 
into four classes by this parameter as follows; 

- Type I: A cell does not evolve and disappears. 

- Type II: The period of dividing membranes and dissolving membranes ap- 
pears cyclically. 

- Type III: A cell evolves to a complex, hierarchically structured cell. 

- Type IV: All chemical compounds inside a cell grow rapidly. However, cells 
hardly divide. 

Each Ae value is not very important. Although they change in the different 
environments, these four classes are unchanged. 

Type I (\e Close to 0.0). When Ag is close to 0.0 the cooling rule is mainly used, 
and cells will hardly grow up. Thus membranes will hardly be divided (figure 
21 ). 
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step state 

0 . 

1. [a^6^c“] 

2 . [a^°y,c^] 

3. [a^^, 6®, c^] 

4. [a^6^c®] 



10. [a^, 6"^, c^] 



16. [a^, 6"^] 

Fig. 21. An example of state transition of ACS: Type I (Ae close to 0.0) 



step state 

0 . 

1. [a^&^c®] 



91. [a^, b'^, c^] 

92. [[b^] [&^cl]] 

93. [[b^] 

94. [a®, b^ ,c^] 



114. [a^&®] 

115. [[a\b^][a\b^]] 

116. [[ a\b^m[a\b^]] 



140. [[[ a\b^][b\cV][a\b^]][b\c^]] 

141. [[a®, c®][a®, c^][a^, c^] 

142. [ a®,c® ] 

Fig. 22. An example of state transition of ACS: Type II (Ae in between 0.5 and ~ 1.0) 



Type II (Xe in between 0.5 and ~ 1.0). When Ag is in between 0.5 ~ 1.0, mem- 
branes are more likely to be divided than when Ag is close to 0.0. But, since 
cooling rules are likely to be used, the membrane compounds do not increase 
very much. Thus, when a cell evolves to a certain size the membrane compounds 
of each cell decrease and they are dissolved (figure 22). Thus there emerges a 
cell cycle behavior. 

Type III (Xe in between 1.05 ~ 2.33). When Ag exceeds 1.0 and the heating 
rule is likely to be used, a cell grows up easier and the frequency of cell division 
becomes high. Thus a cell evolves into a large complex cell (figure 23). 

Type IV (Xe More than 2.5). When the Ag parameter becomes much larger than 
1.0, membranes will hardly be divided, because the number of compounds of all 
kinds in the cell increase and it is difficult to specifically increase the number of 
only membrane compounds. Consequently, a cell will hardly divide (figure 24). 
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step state 
1 r „io rio 



41. 

42. [[6^] [b^c^]] 

43. [[6^] [6^cll] 

44. [[6^] [b^][b\c^]] 

45. [[b\c^][b^,c^][b\c^]] 

46. [[[b^c^][b\c^]][b‘^,c^][b\c^]] 

47. [[[[b^][a\b^]][b\^]][a\b^c^]][a\b‘^,c^ ]] 

140. [[[ai,6i][b\ci]][[62][ai,6i]][fei,ci]] 

214. [[[[«!«, fe^c2][a^6^cl][a^ 6^1] [a\b^]][[[[b‘^][b^][a\b^]][a^b^]] [a^ 6l][fe"][a^ fti] 



Fig. 23. An example of state transition of ACS: Type III (Ae in between 1.05 ~ 2.33) 



step state 

0. [aio,bio,ci°] 

1. [al^b^c®] 

2. [al^b^cl°l 

3. [al^b^c“] 

4. [al^b^cl2l 

5. [a®,6®,ci°] 

6. [a’’,fei°,ci0] 



87. 



147. 



242 



300. [a^5®^c®®] 

Fig. 24. An example of state transition of ACS: Type IV (Ae more than 2.5) 



We believe that the division in four classes describes the basic behavior of 
the system. However, when S or a are changed the Ae value that corresponds to 
each type is also changed. Thus a deeper investigation is needed with respect to 
the correlation between <5, cr and Ag. 

5.3 Genetic ACS (GAGS) 

Since a rewriting rule promotes a reaction, it can be regarded as an enzyme. Here 
we extend ACS with evolutionary mechanism, and we call the system Genetic 
ACS (GACS). 

Descriptions of GACS. A transition GACS is a construct 
r = {A,p,,Mi, ...,Mn,R,5,a), 
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where: 

(1) A is a set of objects; 

(2) /i is a membrane structure (it can be changed during a computation); 

(3) Ml, M„, are multisets associated with the regions 1,2, ... n of /x; 

(4) i? is a finite set of multiset evolution rules over A; 

(5) 6 is the threshold of dissolving membrane; 

(6) a is the threshold of dividing membrane. 

The way of applying rewriting rules, the way of dissolving and dividing, as well 
as the input and output are the same as for ACS and ACSE. 

An Enzyme. We denote a set of reaction rules as follows: 





a 


b 


C 


a 


^aa 


^ab 


^ac 


b 


^ba 


^bb 


^bc 


c 


Xca 


^cb 





where Xij means the number of compounds i which are transformed from j . For 
example, 2ab means a rewriting rule, b ^ a,a. We call the table a transformation 
table. 

Transmission of an Enzyme. When a membrane is divided, the enzyme which 
is inside the membrane is copied and passed down to a new divided cell. At that 
time, a point mutation occurs only in the copied enzyme and it is passed down 
to the new cell. The enzyme remains in the old membrane as well as the new 
one. Point mutations occur every time a membrane divides. When a membrane 
is dissolved the enzyme which is inside the membrane loses its activity. A point 
mutation is a rewriting of the number of Xij. Thus, it changes the number of 
transforming compounds to i. We assumed Xij G {0, 1, 2}. In ACS and ACSE the 
system has only one set of rewriting rules, however, in GACS, each membrane 
has its set of rules (like in P systems). 



An Experimental Result of a GACS. We will discuss some experimental 
results about GACS. Specifically, the following GACS was simulated: 

r = {A = {a, b, c}, n = {0}, Mo = {a^°, b^°, c^O}, R, MC = {c}, ,5 = 0.4, = 

0 . 2 ), 

where the transformation table (R) is as follows: 





a 


b 


c 


a 


Oaa 


0o6 


lac 


b 


Iba 


Obf, 


Obc 


C 


Oca 


Icb 


Occ- 
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The productivity of membrane compounds p is defined as the ratio of the 
total number of non-membrane compounds to be produced to the total number 
of membrane compounds to be produced: 



P = 



yi=b 



When p = 0 the enzyme does not produce any membrane compounds, when 
p = 1, it produces the same number of membrane compounds as the non- 
membrane compounds, and when p > 1, it produces more membrane compounds 
than non-membrane compounds. 

Figure 25 illustrates the time series of productivity, where the vertical axis 
illustrates the productivity, the horizontal axis illustrates the steps and each dot 
is an enzyme. It shows that at first almost all enzymes evolve to p > 1 . However, 
after 100 steps, the productivity of the enzymes decreases. After 100 steps, both 
the number of cells and the sizes of cells increase exponentially. Furthermore, 
the structure of cells becomes complicated. 

Figure 26 illustrates the correlation between the number of cells, the size of 
cells and the number of steps, where each dot corresponds to a cell. Figure 27 
illustrates the internal nodes of the whole system. If we regard the outermost 
membrane as the root and other cells as internal nodes and leaves, we can regard 
the whole system as a tree. In order to indicate the complexity of the tree, we use 
the number of internal nodes in the tree. Figure 27 illustrates that the number 
of internal nodes increases exponentially, after 150 steps. 




Steps 



Fig. 25. Productivity of enzymes. 




Fig. 26. State transition of the system 
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Steps 



Fig. 27. Internal nodes of the cell 



It is interesting that when a cell grows into a hierarchical cell, the enzyme 
evolves to a low productivity one. 

The reason of this behavior can be considered as follows; the enzyme whose 
productivity is high always suffers from mutations, because it promotes mem- 
brane division, it generates more mutations than low productivity ones. If every 
cell is an elementary cell, an enzyme have to keep producing membrane com- 
pounds at a high rate. However, when the cell forms a structure, an enzyme with 
a high productivity is no longer necessary, because if an internal cell dissolves, 
the cell that includes the dissolved cell obtains its membrane compounds. 

Therefore, during the evolution of a cell into a structured cell, the cell needs 
a highly productive enzyme. However, once it evolves a structured cell, highly 
productive enzymes are weeded out. This is the role of membranes in terms of 
computation. 

5.4 Biological Inspired Computational Models 

In this section, we will address computer science aspects of ACS and GAGS. We 
are interested in living systems in terms of computation but not interested in 
solving optimization problems, solving NP-problems, and so on. 

The aim of this study is to create a biological inspired computing model. 
Computational living systems. In such a system, in order to obtain the result, we 
observe the behavior and change the environment conditions, without stopping 
the “computations” . In other words, we steer the system to the right direction 
by changing the environment and lead the system to our settled goal. Although 
such a system may not be suitable to obtain an optimizer, it may be suitable to 
create a system like living systems. This contribution is a preliminary research 
toward this goal. 



Genetic Programming by Using GAGS 

How We Will Compose It? By using GAGS, we attempted to generate a program 
by using a GAGS. In the GAGS, a program corresponds to an enzyme, thus we 
breed an enzyme which can solve a particular problem. We apply this approach 
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to the simple problem of doubling: calculate the double value of the number of 
a and b then show the result as the number of c (c = 2(a + 6)). 

Description of GACS. The used GAGS is defined as follows: 

r = {A = {a, 6,c},^ = {[i]i...[ioo]ioo}, Afo = {[a^, 6^, i?). In the 

initial state, all transformation tables R are 





a 


b 


c 


a 


Oaa 


0o6 


lac 


b 


Iba 


Obb 


Obc 


c 


Oca 


Icb 


Occ? 



and one hundred of elementary cells are assumed inside Mg. No compounds are 
transformed among cells and no input and output are assumed. Although we 
performed simulation by using different types of cells in the initial state, the 
experimental results are the same, thus we will address only the case when in 
the initial state each cell is [of , , c°] . 

Dissolving and Dividing a Membrane. The way of dissolving and dividing are 
the same as in ACSE. After n rewriting steps, if the number of c is smaller 
than 7, the membrane is dissolved and the enzyme inside it loses its activity. For 
example, 

[ha^,b^,c^]h[ia‘^,b^,c^]i [ha^,b^,c\. 

In the above example, the number of c inside the membrane i is smaller than 
7, so the membrane is dissolved. 

After n rewriting steps, if the number of c is larger than 9, the membrane 
is divided and a point mutation takes place in its enzyme. Furthermore, a new 
enzyme is passed down to a new cell. When a cell divided, the inside multiset 
of the divided cell and its parent cell are set to {a^, 6^, c°} again, so they try to 
solve the problem again. The results is: 

[ha^,b^,c^Q]h [hO^,b'^,c°]h[ia^,b'^,c\ 

In the example, because the number of c inside the membrane h is larger than 
9, the membrane h is divided and a new membrane i emerges. Then compounds 
which are inside both membranes i and h are set to {a^,6^,c°}. 

The Fitness of the GAGS. The fitness of an enzyme is defined as the number of 
steps to reach the solution. By using this fitness, good enzymes that can solve 
the problem within a smaller number of steps than the others are selected. 



Experimental Result. At first, all enzymes are set to: 



a 



b, 



c, 



c 



a. 




340 Yasuhiro Suzuki et al. 



After 5000 rewriting steps, the enzymes that reach the solution within 8 steps 
are selected: 



a ^ a,c, 
b ^ a, b, c, c, 
c ^ c, c; 



a ^ a,b, c, c, 
b ^ a,b, c, c, 
c ^ c,c; 



a ^ a,c, c, 
b b, 



c, c. 



For each rule, one hundred of elementary cells that are [a^,&^,c°] are set 
again and calculations are performed again. Next, the enzymes that can solve 
the problem within 5 steps are selected. Then, there remains only one enzyme: 



a a,c, c, 
b ^ c, c, 
c ^ c, c. 

This is a solution of this simulation. In fact, a simpler solution of this problem 
is 



a ^ c,c, 
b ^ c,c, 
c ^ c; 

Thus the survived enzyme evolved to a similar enzyme in the solution. 

This model is a candidate of the computational living system. In the future 
we plan to create a GAGS as an artificial living system of computation that can 
solve more complicated problems. 



5.5 Modeling and Analyzing an Ecological System by Using ARMS 

ARMS can be a tool to model a complex system such as ecological systems, 
social systems and so on. We will consider here an example of application of an 
ARAIS for modeling an ecological system. 

A phenomenon in which plants may respond to herbivore feeding activities 
by producing volatiles that in turn attract carnivorous enemies of the herbivores 
has been reported recently [11] [44] [12]. These volatiles are not the mere result 
of mechanical damage, but are produced by the plant as a specific response to 
herbivore damage. 

In theoretical biological studies on such systems, one notable study was done 
by Sabelis and de Jong [51] who showed that there was a wide range of conditions 
for signaling plants to coexist with plants that spend their enemy in alternative 
ways, the kinds of volatiles become polymorphic within species. Through the 
use of game theory, Sabelis and de Jong showed that ESS corresponds to the 
case when each species of plant produces the volatiles in a polymorphic way. In 
order to investigate the population dynamics of tritrophic systems, we employ 
the abstract rewriting system on multisets, (ARMS) [54] [53] . 
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Implementing an Ecological System by Using the ARMS. In this sec- 
tion, we model the tritrophic system by using ARMS. Let the symbol “a” be a 
leaf, “b” be a herbivore, “d” be a carnivore and “c” be the density of herbivore- 
induced volatiles. Furthermore, we define “e” be an empty state in order to 
introduce the death state. A plant is defined implicitly as a number of leaves. 
Evolution rule R\ is defined as follows: 

a ^ a, a ri (increase in the number of leaves), 
a,b^b,b,c V 2 (herbivore eats a leaf), 
d, b, c ^ d, d T 3 (carnivore catches herbivore), 
d ^ e T 4 (death of a carnivore) , 
b ^ e rs (death of a herbivore) . 

Here, denotes the reaction rate, corresponding to the frequency of 

rule application. For example, when k^ = 0.1 and k^ = 0.2, r^ will be applied 
twice as often as r 4 . 

In this case, k 2 is defined according to the state of multiset, i.e., 
k = 

^ M(a) + M(b) + M(d) ■ 

Rule ri corresponds to the sprouting and growth of a plant, T 2 corresponds to the 
case when a herbivore eats a leaf and the leaf generates volatiles, r^ corresponds 
to the herbivore being preyed on by a carnivore, r 4 corresponds to the death 
of a carnivore and r^ corresponds to the death of a herbivore. More precisely, 
V 2 denotes the case when a leaf exists (a), and a herbivore eats the leaf and 
breeds there (b). The leaf produces volatiles compounds (c) as a result of being 
eaten, so as to attract carnivores. Rule r^ denotes the case when there is a 
herbivore present (b) with the volatiles (c), and a carnivore (d) is attracted by 
the volatiles of r 2 , catches the herbivore and breeds there (dd). The breeding 
rate of the carnivore is expressed as the changing the right-hand side of ds, e.g., 
d,b,c^ d, d, d, d. 

Using this model, we compared the case when leaves generate volatiles with 
the case where they do not. The evolution rules of the system without volatiles 
i ?2 are defined as follows: 

a ^ a, a ke (increase in the number of leaves), 
a,b ^ b,b kr (herbivore eats a leaf), 
d,b^d,d ks (carnivore catches herbivore), 
d ^ e kg (death of a carnivore), 

& ^ e fcio (death of a herbivore). 

We set the reaction rates of ki, k^, fcs, kg, kg and fcio as 0.5, 0.1, 0.1, 0.5, 0.1 
and 0.1, respectively. 
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The number of a, b,c 

a 




Fig. 28. Time course of population size of plants (a), herbivores (b) and carnivores (c) 
= 0.55) 



The number of a, b,c 

a 




Fig. 29. Time course of population size of plants (a), herbivores (b) and carnivores (c) 
(until plants extinction, R 2 ,rg = 0.55) 



Simulation and Results. In both Ri and i?2, the symbiotic relationship con- 
sisting of plants, herbivores and carnivores can be identified. It is interesting that 
i?2 (the relationship under system without volatiles) is likely to collapse under 
the same conditions as for Ri depending on the number of plants, herbivores 
and carnivores in the initial state, and the reaction rate of and rg. 

The time course of population size of plants, herbivores and carnivores of Ri 
(figure 28) is very different from i ?2 (figure 29). 

In Ri there are three attractors (a, b, c); (200, 100, 22)(I), (180, 105, 27)(2) 
and (150, 120, 46) (3). Tritrophic population is first attracted by (1), then (2) 
and finally (3). The progression for i ?2 is attracted to (120, 200, 82), but departs 
from there as the system quickly degenerates. 

These results show that the system with volatile generation by plants is more 
robust than the system without volatile generation under otherwise identical 
conditions. Thus, the role of volatiles warrants further investigation. 



6 Final Remarks 

A. Turing has abstracted the essence of a computation in terms of (human) 
calculation by pen and paper, in this way the notion of a Turing Machine has 
emerged, the main computing model investigated in computer science. However, 
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the model is considered restrictive and many people have tried to consider non- 
Turing computing models. 

It is obvious that living systems never do calculation by using pen and paper. 
Thus, if we could abstract the idea of a computation in terms of living systems, 
then it is possible to reach a the computational principle of emerging hierarchical 
structure “out of nothing” . 

We have not found the principle yet, but we have obtained some hints about 
its basic features. They are: 

(1) Emergence of the hierarchical structure of interactions 

(2) Parallelism. 

Emergence of the Hierarchical Structure of Interactions. We saw that the mem- 
brane structure produces a hierarchical structure. However, in the present model, 
the produced membrane structures do not interact with each other. We must 
consider the framework that allows further interactions among these emergent 
structures. 

Parallelism. In a living system, every cell and every organ is acting in parallel. 
Moreover, if some organs or cells lose their functionality, another organs or cells 
try to increase their activity for recovering/compensate and the system keeps its 
activity. 
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Abstract. This paper is an attempt to summarize the basic elements of 
the multiset theory. We begin by describing multisets and the operations 
between them, then we present hybrid sets and their operations. We 
continue with a categorical approach to multisets, and then we present 
fuzzy multisets and their operations. Finally, we present partially ordered 
multisets. 



1 Introduction 

Many fields of modern mathematics have been emerged by violating a basic prin- 
ciple of a given theory only because useful structures could be defined this way. 
For example, modern non-Euclidean geometries have been emerged by assuming 
that the Parallel Axiom^ does not hold. Similarly, multisets [8,13,16] have been 
defined by assuming that for a given set A an element x occurs a finite num- 
ber of times. Multisets are also known as “bags” (but many consider this term 
too vulgar...), “heap”, “bunch”, “sample”, “occurrence set”, “weighted set” 
and “fireset” — finitely repeated element set. An argument against the position 
that the term “bag” is too vulgar is that this term is a plain English word — 
something in which we put things to carry them around. Besides, in English 
language mathematical literature it is a tradition to use plain words -group, set, 
ring, ... - unlike other sciences, where people invent new long ones by sticking 
Greek and Latin words together. Also, we must note that the term “multiset” 
has been coined by N.G. de Bruijn [8]. The first person who actually used mul- 
tisets is Richard Dedekind in his well-known paper “Was sind und was sollen 
die Zahlen?” ( “The nature and meaning of numbers” ) [6] . This paper was pub- 
lished in 1888. The reader interested to read a rather complete account of the 
development of multiset theory should read Blizard’s excellent survey [5] . 

From a practical point of view multisets are very useful structures arising 
in many areas of mathematics and computer science. The prime factorization 
of an integer n > 0 is a multiset Af whose elements are primes. Every monic 
polynomial f{x) over the complex numbers corresponds in a natural way to the 

* Dedicated to the fond memory of my brother Mikhail Syropoulos. 

^ Which can be stated as follows: Given a point P not incident with line m, there is 
exactly one line incident with P and parallel to m. 



C.S. Calude et al. (Eds.): Multiset Processing, LNCS 2235, pp. 347—358, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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multiset T of its “roots”. Other examples of multisets include the zeros and 
poles of meromorphic functions, the invariants of matrices in a canonical form, 
the invariants of finite Abelian groups, etc. The terminal strings of a context- 
free grammar form a multiset which is a set iff the grammar is unambiguous. 
Processes in an operating system can be thought of as multisets. The mathe- 
matical treatment of concurrency involves the use of multisets. In social sciences, 
multisets can be used to model social structures, etc. 

There are three methods to define a set and we are recalling them now, since 
they will be heavily used in the rest of the text: 

1. A set is defined by naming all its members {the list method). This method 
can be used only for finite sets. Set A, whose members are ai, 02 , . . . , fln, is 
usually written as 

A = {ai,02, . . . ,a„}. 

2. A set is defined by a property satisfied by its members {the rule method). A 
common notation expressing this method is 

A={x\ P{x)}, 

where the symbol | denotes the phrase “such that”, and P{x) designates a 
proposition of the form “cc has the property P.” 

3. A set is defined by a function, usually called the characteristic function, that 
declares which elements of a universal set X are members of set A and which 
are not. Set A is defined by its characteristic function, XA, as follows: 



Xa{x) 



1,A X G A 
0,if X ^ A 



In what follows we present the definition of multisets and the basic operations 
between multisets. Moreover, we briefly present hybrid sets, i.e., multisets which 
may have negative integers as multiplicities as well as nonnegative integers. Then, 
we proceed with a categorical approach to multisets by defining categories of 
multisets. Next, we present fuzzy multisets and their operations. We finish by 
presenting pomsets and their basic operations. 



2 Multisets and Their Operations 

Ordinary sets are composed of pairwise different elements, i.e., no two elements 
are the same. If we relax this condition, i.e., if we allow multiple but finite 
occurrences of any element, we get a generalization of the notion of a set which 
is called a multiset. 

There are two different kinds of sets with a finite number of repeated ele- 
ments: sets with distinguishable repeated elements, e.g., people sharing a com- 
mon property, and sets with indistinguishable repeated elements, e.g., a “soup” 
of elementary particles. Monro [12] calls the first kind of sets multisets and the 
second multinumhers. However, in order to avoid confusion, we will use the term 
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multiset for Monro’s multinumbers and the term real multisets for Monro’s mul- 
tisets. 

Real multisets and multisets are associated with a (ordinary) set and an 
equivalence relation or a function, respectively. Here are the formal definitions: 

Definition 1. A real multiset X is a pair {X,p), where X is a set and p an 
equivalence relation on X. The set X is called the field of the real multiset. 
Elements of X in the same equivalence class will be said to be of the same sort; 
elements in different equivalence classes will be said to be of different sorts. 

Given two real multisets X = (X,p) and y = (R, cr), a morphism of real 
multisets is a function f : X which respects sorts; that is, if x,x' G X and 
X p x' , then f{x) a f{x'). 

Definition 2. Let D be a set. A multiset over D is just a pair {D, f), where D 
is a set and f : D ^ N is a function. 

The previous definition is the characteristic function definition method for mul- 
tisets. 

Remark 1. Any ordinary set A is actually a multiset {A,xa), where XA is its 
characteristic function. 

Since multisets are sets with multiple but finite occurrences of any element, one 
can define a multiset by employing the list method. However, in order to avoid 
confusion we will use square brackets for multisets and braces for sets. In what 
follows we will employ the most suitable definition method for each case we 
encounter. 

An important notion in set theory is the notion of a subset. Moreover, for 
ordinary sets there are certain operations one can perform between sets, such as 
set intersection, union, etc. We proceed with the definitions of the notion of the 
subset of a multiset and the operations between multisets. 

Definition 3. Suppose that A = {A, f) is a multiset; the subset B of A is called 
the support of A if for every x such that f{x) > 0 this implies that x G B, and 
for every x such that f{x) = 0 this implies that x ft B. 

It is clear that the characteristic function of B can be specified as: 

Xb{x) = min(/(x),l). 



Example 1. Given the multiset A = [a,a,a,b,c,c], then its support is the set 
{a,b,c}. 

Definition 4. Assume that A = {A, /) and B = {A, g) are two multisets. We 
say that A is a sub-multiset of B, denoted AQB if for all a G A we have 



f{a) < g{a). 
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A is called a proper sub-multiset of B, denoted A C B, if in addition for some 
a € A we have 

f{a) < g{a). 

Obviously, it follows that for any two multisets, ^ iff ^ C ,8 and B C A. 

Definition 5. Let A = {A, f) be a multiset; A is the empty multiset if for all 
a € A, f{x) = 0. 

Definition 6. Suppose that A = {A, f) is a multiset; its cardinality, denoted 
card(^), is defined as 

card(^) = 

a&A 

If A is a set, then is the set of all multisets which have A as their support 
set. Moreover, A is the smallest non-empty multiset in in the sense that if 
B G V^, then 

card(8) > card(A). 

We are now turning our attention to the operations between multisets. We de- 
fine, in this order, the sum, the removal, the union and the intersection of two 
multisets. 

Definition 7. Suppose that A = {A, f) and B = {A, g) are two multisets. Their 
sum, denoted A^B, is the multiset C = {A, h) , where for all a G A: 

h{a) = f{a)+g{a). 

It can be easily shown that the multiset sum operation has the following prop- 
erties: 

1. Commutative: A^ B — B\H B] 

2. Associative: (M W 8) l±l C = M l±l (8 1±) C); 

3. There exists a multiset, the null multiset 0 , such that M W 0 = A. 

It is important to note that there exists no inverse and multiset sum is not 
idepotent. 

Definition 8. Suppose that A = {A, f) and B = {A, g) are two multisets. The 
removal of multiset B from A, denoted AqB, is the multiset C = {A, h), where 
for all a G A: 

h{a) = va&yi{^f{a) - g(a),0^ . 



Definition 9. Suppose that A = {A, f) and B = {A, g) are two multisets. Their 
union, denoted A^B, is the multiset C = {A, h), where for all a G A: 

h{a) = ma,^(^f{a),g{a)y 
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Definition 10. Suppose that A = {A, f) and B = {A, g) are two multisets. 
Their intersection, denoted AC\B, is the multiset C = (A,h), where for all 
a e A: 

h{a) = min(^/(a),g(a)y 

The following properties can be easily established for union, intersection, and 
sum of multisets: 

1. Commutativity: AUB — BUA 

Ar]B = BnA; 

2. Associativity: Ai) {B U C) = {AD B) U C 

An{BDC) = {AnB)nC; 

3. Idempotency: AU A = A 

An A = A; 

4. Distributivity: AD {Bn C) = {AD B) n {AD C) 

An{BDC) = {AnB)D{An C); 

AD{BDC) = {ADB)D{ADC) 

AD{BnC) = {ADB)n{AD C); 

6. An{ADB) = A 

Ad {Ad B) = Ad B\ 



7. ADB={ADB)D{AnB). 

Let A = {X, f) be a multiset and B C X. We are interested in forming the 
multiset C = (X,g), where 



g{x) 



f{x), AxeB, 
0, if cc ^ B, 



or, in other words, g{x) = f{x) ■ xb{x). 

A closely related problem is that of forming a multiset by removing all the 
elements from A which are in the set B. That is, we are interested in forming 
the multiset T) = {X, h), where 



h{x) = \ ^ ^ r’ 

^ > \/(a:), lix^B, 



which can be expressed compactly as follows: 



h{x) = f{x) ■ (1 - XB{x)),yx G B. 
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However, one may note that 1 — Xb(x) is the characteristic function of the 
complement set of B, denoted B. So, the previous equation becomes 

h{x) = f{x) ■ XB(x),yx G B. 



Thus, 



V = A®B. 



We shall call the operation A® B multi-intersection. In general, it holds that 
if ^ = (X,xa), a C X, and H is a set, then A® B = AD B. Moreover, the 
following properties hold: 



A® X 
A®tt> 
{AinA2)®B 
{Ai\JA2)®B 



= A 

= 0 

= (Ml ®B)n (M2 @ B) 
= (Ml @ H) U (M2 @ B) 



3 Hybrid Sets 

Hybrid sets and new sets are generalization of multisets and sets respectively. 
In a hybrid set the multiplicity of an element can be either a negative number, 
zero, or a positive number. A new set is to hybrid sets what is a set to a multiset, 
i.e., a special case. We give now the definition of the hybrid set due to Loeb [9]: 

Definition 11. Given a universe U , any function f : U ^ Z, where Z is the 
set of all integers, is called a hybrid set. The value of f{u) is said to he the 
multiplicity of the element u. If f{u) ^ Q we say that u is a member of f and 
we write u G /; otherwise, we write u ^ f. We define the number of elements, 
fff, to he the sum X]«6C/ /(’*^)- / ^^.id to be an fff (element) hybrid set. 

Hybrid sets are denoted by employing the list method and by inserting a bar 
to separate elements with negative multiplicity from those with a non-negative 
multiplicity. Elements occurring with a positive multiplicity are written on the 
left of the bar, and elements occurring with a negative multiplicity are written 
on the right of the bar. 

Example 2. li f = {a, b,b \ d, e, e} is a hybrid set, then /(a) = 1, f{b) = 2, 
f{d) = -1, and /(e) = -2. 

The empty hybrid set, denoted 0, is the unique hybrid set for which all elements 
have multiplicity equal to zero. We can specify the empty hybrid set by {|}. The 
definition of subset-hood in the case of hybrid sets is also due to Loeb: 

Definition 12. Let f and g he hybrid sets. We say that f is a subset of g and 
that g contains f and we write f Q g if either f{u) g{u) for all u G U, or 
g{u) — f{u) <C g{u) for all u G U, where is a partial ordering of the integers 
defined as follows: i j iff i < j and either i < 0 or j > 0. 
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We proceed now with the definition of the various operations between hybrid 
sets. 

Definition 13. Assume that f and g are two hybrid sets over the same 
universe U. Then, their intersection, f C) g, is the hybrid set h, such that 
h{u) = max{f{u),g{u)), their union, fUg, is the hybrid set h, such that h{u) = 
min(/(u), g{u)), and their sum, f ^ g, is the hybrid set h, such that h{u) = 
f{u)+g{u). 

We easily verify the correctness of these definitions by using the definition of 
subset-hood. 

4 Categorical Models of Multisets 

Let C be a category. A functor E : C°p ^ Set is called a presheaf on C. Thus, 
a presheaf on C is a contravariant functor. The presheaves on C with natural 
transformations as arrows form a category denoted Psh(C). Suppose that C is a 
set, i.e., a discrete category; then, the presheaf F : C ^ Set denotes a multiset, 
since F{c) is a set whose cardinality is equal to the number of times c occurs 
in the multiset. So, for any set C, the category Psh(C) denotes the category of 
all multisets of C. These remarks lead us to the definition of a category of all 
possible multisets: 

Definition 14. Category MSet is a category of all possible multisets. 

1. The objects of the category consist of pairs {A,P), where A is a set and 
P : A ^ Set a presheaf on A. 

2. If{A,P) and (B,Q) are two objects of the category, an arrow between these 
objects is a pair (/, A), where f : A ^ B is a function and X : P ^ Q o f is 
a natural transformation, i.e., a family of functions. 

3. Arrows compose as follows: suppose that {A, P) {B, Q) and {B, Q) 

(C, R) are arrows of the category, then (/, A) o (g, g,) = (g o f, g x A), where 
g o f is the usual function composition and g x X \ P ^ Ro {g o f). 

4-. Given an object {A,P), the identity arrow is (id^,idp). 

The last part of the definition is a kind of wreath product (see [4]). However, 
it is not clear at the moment how this definition fits into the general theory of 
wreath products. 

This is not the only way one can categorically define multisets. Suppose 
that F : A ^ Set is a presheaf and that A is a set. Then if we form the set 
X = Aj, where Xi = F{i), we can define the function p ■. X ^ A. This 
function is equivalent to the presheaf F. Moreover, p~^{a), i.e., the preimage of 
p, is the set of copies of a in the multiset. Now, we can define another category 
of all possible multisets: 

Definition 15. Category Bags is a category of all possible multisets. 

1. The objects of the category consist of pairs (A,p), where p : UieA Al. 




354 Apostolos Syropoulos 



2. An arrow between two objects {A,p) and (B,q) is a pair {f,g), where f : 
A ^ B and g : X ^ Y , such that the following diagram commutes: 

9 



X 



A 



Y 



B 



3. Suppose that (A,p) {B,q) and (B,q) ^'La) (^C\r) are two arrows. Then 



(/, g) o (/', g') = (/' o f,g' o g) such that in the following diagram 




the outer rectangle commutes iff the inner squares commute. 

4 . Given an object (A,p) the identity arrow is (idyi,idx)- 

It is obvious that categories MSet and Bags are equivalent. Moreover, one can 
study the properties of the categories MSet and Bags, but we feel this is not 
the appropriate place for such a presentation. We now investigate the way one 
can embed the category Bags into a Chu category [15]. 

Given an arbitrary object _L in a category A, we construct the category 
Chu(A, _L) as follows: 

1. The objects of Chu(A,_L) consist of triplets (Ai,r, A2), where Ai, A2 are 
objects in A and r : Ai 0 A2 ^ _L is an arrow in A. 

2 . An arrow from (Ai,r, A2) to (Si,s,i?2) is a pair (/,/), where f : Ai ^ Bi 
and f : B2 ^ A 2 are arrows in A such that the square 

/ G ids2 

Ai G B2 Bi G B2 



idyll ® / 



Ai G A 2 ► _L 

r 

commutes. 

3 . Arrow composition is defined pairwise. 

If A is any ^-autonomous category (see [1,2]), then Chu(A,_L) is A^, where _L 
is a dualizing object [ 3 ]. It is now easy to define a full embedding of Bags into 
Chu(Rel, 1), where Rel is the category of sets and binary relations between 
them and 1 is any singleton set. 
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We start by defining the object part of the functor: 

Definition 16. (Object part) Functor Ai maps each object {A,p) o/Bags into 
the Chu space (A,p,X), where X = domp, and p is the relation obtained from 
function p and a p Xi iff the multiplicity of a is equal to the cardinality of Xi. 

We now proceed with the arrow part of the functor: 

Definition 17. (Arrow part) Let {A,p) and (B,q) be two objects of the cate- 
gory Bags. Moreover, suppose that (A,p) {B,q) is an arrow between these 

objects; then AA{f,g) = (f,g~^), where f is the relation obtained from function 
f and g~^ the inverse of the relation obtained from function g. 

The possible consequences of this embedding are explored in [15]. 

5 Fuzzy Multisets 

Fuzzy set theory has been introduced as a means to deal with vagueness in 
mathematics. The theory is well-established and we will not get into the trouble 
of presenting it. We just note that fuzzy set theory was an attempt to develop a 
formal apparatus to involve a partial membership in a set, mainly to arm people 
in the modeling of empirical objects and facts. In other words, fuzzy set theory 
is, sort to say, a generalization of the notion of set membership. 

Definition 18. Suppose that X is a set. Any function A : X ^ I, where I = 
[0, 1], is called a fuzzy subset of X. Function A is usually called the membership 
function of the fuzzy subset A. 

Fuzzy multisets have been introduced by Yager [16] and have been studied by 
Miyamoto [10,11] and others. A fuzzy multiset of some set X is just a multiset 
of A X I. We are now defining summation of fuzzy multisets: 

Definition 19. If A = (A x I, /) and B = {X x I, g) are two fuzzy multisets, 
then their sum, denoted B, is the fuzzy multiset C = {X x 1, h) , where for 
all {x, px) G A X I.- 

h{x, px) = f{x, px) + g{x, px)- 

As in the case of crisp^ multisets, there is more that one way to define a fuzzy 
multiset. In order to define the basic operations between fuzzy multisets, we de- 
fine fuzzy multisets by the list method. If A= {{xi, be a fuzzy mul- 

tiset, then we can write the same set a,s A = {{/in, • ■ • , /ii^jl/xi, . . . , {/i„, . . . , 
p.n£n}/xn}. Note that {/ill, . . . , /ii^j} is actually a multiset of I. Next, we re- 
arrange the multisets {pn, . . . , p,ii„} so that the elements appear in decreas- 
ing order. Finally, we need to add zeroes so that the length of all multisets 
{/ill, ■ • • , P'lii } is the same. This representation is called the graded sequence. To 
make things clear we give an example: 

In fuzzy set theory the term crisp is used to characterize anything that is non-fuzzy. 



2 
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Example 3. Let 

A = [(a, 0.2), (6, 0.5), (6, 0.1), (a, 0.2), (a, 0.3), {d, 0.7)] 

be a fuzzy multiset. Then its graded sequence is 

A = [{0.3, 0.2, 0.2}/a, (0.5, 0.1, 0}/6, (0.7, 0, 0}/d] 

In case we want to perform certain operations on two or more fuzzy multisets, 
all multisets {/in, . . . ,/ii^i} must have the same length. Moreover, even if one 
fuzzy multiset does not contain an element c we must add an entry of the form 
{0, 0, . . . , 0 /c}, where p is the length of all other multisets. We are now giving 

p times 

the definitions of the various operations between fuzzy multisets: 

Definition 20. Assume that A = [[/iip, . . . , /inj/xi, . . . , [unp, ■ ■ ■ , /inij/a^n] arid 
^ = [[dip, ■■■, d'ii]/xi , . . . , [/i'„p, . . . , d'ni]/^n] are two fuzzy multisets; then 

1. AC B iff for every Xi, /i^ < ffj, j = l,...,p. 

2. A = B iff for every Xi, fHj = ffj, j = l,...,p. 

3. C = ACB, where C = [[fffp, . . . , /inj/xi, . . . , [/r"p, . . . , /i"i]/x„] iff for every 

Xi, /i" = fiij), j = l,...,p. 

4- C = AnB, where C = [[ff(p, y-'(i]/xi, [fffp, . . . , /i"i]/x„] iff for every 
Xi, ffh = min(/ib, /iy), j = 1, . . . ,p. 

When the functions max and min are replaced by a t-norm t and a t-conorm s 
respectively, we obtain the definitions for n* and Ug, respectively. The union and 
intersection of arbitrary fuzzy multisets A, B, and C satisfy the following laws: 

1. Commutativity. AUB = BUA 

Adb^bda 

2. Associativity. {AC B) C C = AC {B C C) 

{ACB)CC = AC{BCC) 

3. Distributivity. {AC B) C\ C = {AC\ B) C {AC\ C) 

{Ar\B)\JC = {AcB)r\{A\JC) 

Next, we define the a-cut for fuzzy multisets. We first recall the notion of 
the a-cut for a fuzzy set: 

Definition 21. Let U he a set, let C be a partially ordered set and let A : U ^ 
C. For a G C, the a-cut of A, is a) = {u GU \ A{u) > a}. The subset of 

U will he denoted by Aa- 

Definition 22. Assume that A = {X x I, /) is a fuzzy multiset, and that a G 
(0, 1]. Then Aa = {X,f), i.e., the a-cut of A, is a multiset such that 

f{x)= f{x,Hx)- 

p.x>a 
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Consequently, the a-cut of a fuzzy multiset is just a multiset. 

Given a fuzzy multiset A = {X x I, h) and function f : X ^ Y we can define 
two images: 



f[A] = l±l,,6^{/(a;)} 
f{A)= U{/(^)} 

xeA 

Note that in case the fuzzy multiset is just a fuzzy subset, the second image 
corresponds to the extension principle of fuzzy set theory. 

6 Partially Ordered Multisets 

Partially ordered multisets (or just pomsets) have been used by Pratt [14] as a 
means to model concurrency. In this model a process is a set of pomsets. Here 
we will only present the definition of a pomset and the basic operations between 
pomsets. The reader interested in learning more on their use on modeling con- 
currency is refereed to Pratt’s paper. The following definition of pomset is due 
to Gischer [7] and is copied verbatim from Pratt’s paper: 

Definition 23. A labeled partial order (Ipo) is a 4~tuple {V,S,<,p) consisting 

of 

1. a vertex set V, typically modeling events; 

2. an alphabet S (for symbol set), typically modeling actions such as the arrival 
of integer 3 at port Q; 

3. a partial order < on V , with e < f typically being interpreted as event e 
necessarily preceding event f in time; and 

4- a labeling function n : V ^ E assigning symbols to vertices, each labeled 
event representing an occurrence of the action labeling it, with the same 
action possibly having multiply occurrences, that is, /i need not be injective. 

A pomset is then the isomorphism class of an Ipo, denoted \V,E,<,pi]. 

Now we are ready to define the basic operations between pomsets: 

Definition 24. Assume that p = [V,S,<,p] and p' = [V , E' , , p.'] are two 

pomsets. Then: 

1. their concurrence p\ \p' is the pomset \V U V , E U E' , < U <', p,U pf], where 
V and V are assumed to be disjoint; 

2. their concatenation p; p' is as for concurrence except that instead o/ < U <' 
the partial order is taken to be <\J <' U(P x V); and 

3. their orthocurrence p x p' is the pomset [V x V', E x E' , < x <',p x p). 
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